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METHODS FOR GENERATING HYPERMUTABLE MICROBES 

This application claims the benefit of provisional application S.N. 
60/181,929 filed February 11, 2000. 

FIELD OF THE INVENTION 

The invention is related to the area of mismatch repair genes. In 
particular it is related to the field of in situ mutagenesis of single celled 
organisms. 

BACKGROUND OF THE INVENTION 

Within the past four years, the genetic cause of the Hereditary 
Nonpolyposis Colorectal Cancer Syndrome (HNPCC), also known as 
Lynch syndrome II, has been ascertained for the majority of kindred's 
affected with the disease (14). The molecular basis of HNPCC involves 
genetic instability resulting from defective mismatch repair (MMR). 
Several genes have been identified in humans that encode for proteins and 
appear to participate in the MMR process, including the mutS homologs 
GTBP, hMSH2, and hMSffl and the mutL homologs hMLHl, hMLHS, 
hPMSl, and hPMS2 (4,9,1 1,17,19,22,24,38). Gennline mutations in four 
of these genes (hMSH2, hMLHl, hPMSl, and hPMS2) have been identified 
in HNPCC kindred's (13). Though the mutator defect that arises from the 
MMR deficiency can affect any DNA sequence, microsatellite sequences 
are particularly sensitive to MMR abnormalities (14, 25, 27, 29). 
Microsatellite instability (MI) is therefore a useful indicator of defective 
MMR. In addition to its occurrence in virtually all tumors arising in 
HNPCC patients, MI is found in a small fraction of sporadic tumors with 
distinctive molecular and phenotypic properties (13). 

HNPCC is inherited in an autosomal dominant fashion, so that the 
normal cells of affected family members contain one mutant allele of the 



relevant MMR gene (inherited from an affected parent) and one wild-type 
allele (inherited from the unaffected parent). During the early stages of 
tumor development, however, the wild-type allele is inactivated through a 
somatic mutation, leaving the cell with no functional MMR gene and 
resulting in a profound defect in MMR activity. Because a somatic 
mutation in addition to a germ-line mutation is required to generate 
defective MMR in the tumor cells, this mechanism is generally referred to 
as one involving two hits, analogous to the biallelic inactivation of tumor 
suppressor genes that initiate other hereditary cancers. In line with this 
two-hit mechanism, the non-neoplastic cells of HNPCC patients generally 
retain near normal levels of MMR activity due to the presence of the 
wild-type allele (11,13, 24). In addition, similar findings are observed in 
other diploid organisms (2, 5,8). 

The ability to alter signal transduction pathways by manipulation of 
a gene product's function, either by over-expression of the wild type 
protein or a fragment thereof, or by introduction of mutations into specific 
protein domains of the protein, the so-called dominant-negative inhibitory 
mutant, were described over a decade ago in the yeast system 
Saccharomyces cerevisiae by Herskowitz (Nature 329:219-222, 1987). It 
has been demonstrated that over-expression of wild type gene products can 
result in a similar, dominant-negative inhibitory phenotype due most likely 
to the "saturating-out" of a factor, such as a protein, that is present at low 
levels and necessary for activity; removal of the protein by binding to a 
high level of its cognate partner results in the same net effect, leading to 
inactivation of the protein and the associated signal transduction pathway. 



Recently, work done by Nicolaides et.al. (32) has demonstrated the 
utility of introducing dominant negative inhibitory mismatch repair mutants 
into mammalian cells to confer global DNA hypermutability. There is a 
need in the art for additional techniques for generating mutations in 
5 bacteria which can be used to make strains for production, biocatalysis, 
bioremediation, and drug discovery. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method for 
10 rendering bacterial cells hypermutable. 

It is another object of the present invention to provide genetically 
altered bacteria. 

It is yet another object of the present invention to provide a method 
to produce bacteria that are hypermutable. 
15 It is an object of the invention to provide a method to inactivate the 

process that results in hypermutable cells following strain selection. 

It is a further object of the invention to provide a method of 
mutating a gene of interest in a bacterium. 

These and other embodiments of the invention are provided by one 
20 or more of the embodiments described below. In one embodiment, a 

method is provided for making a hypermutable bacteria. A polynucleotide 
comprising a dominant negative allele of a mismatch repair gene is 
introduced into a bacterium, whereby the cell becomes hypermutable. 
Preferably the allele is under the control of an inducible transcription 
25 regulatory sequence. 

According to another aspect of the invention a homogeneous 
composition of cultured, hypermutable, bacteria is provided. The bacteria 
comprise a dominant negative allele of a mismatch repair gene. Preferably 
the allele is under the control of an inducible transcription regulatory 
30 sequence. 



Another embodiment of the invention provides a method for 
generating a mutation in a gene of interest. A bacterial culture comprising 
the gene of interest and a dominant negative allele of a mismatch repair 
gene is grown. The cell is hypermutable. It is tested to determine whether 
5 the gene of interest harbors a mutation. Preferably the allele is under the 
control of an inducible transcription regulatory sequence. 

According to still another aspect of the invention a method for 
generating a mutation in a gene of interest is provided. A bacterium 
comprising the gene of interest and a dominant negative allele of a 

1 0 mismatch repair gene is grown to form a population of mutated bacteria. 
The population of mutated bacteria is cultivated under trait selection 
conditions. At least one of the cultivated bacteria is tested to determine that 
the gene of interest harbors a mutation. Preferably the allele is under the 
control of an inducible transcription regulatory sequence. 

15 Still another aspect of the invention is a method for enhancing the 

mutation rate of a bacterium. A bacterium comprising a dominant negative 
allele of an MMR gene is exposed to a mutagen whereby the mutation rate 
of the bacterium is enhanced in excess of the rate in the absence of mutagen 
and in excess of the rate in the absence of the dominant negative allele. 

20 Preferably the allele is under the control of an inducible transcription 
regulatory sequence. 

Yet another aspect of the invention is a method for 
generating an MMR-proficient bacterium with a new output trait. A 
mismatch repair deficient bacterium comprising a gene of interest and a 

25 dominant negative allele of a mismatch repair gene is grown to form a 
population of mutated bacteria. The population of mutated bacteria is 
cultivated under trait selection conditions. At least one of the cultivated 
bacteria is tested to determine that the gene of interest harbors a mutation. 
Mismatch repair activity is restored to the at least one of the cultivated 
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bacteria. Preferably the allele is under the control of an inducible 
transcription regulatory sequence. 

These and other embodiments of the invention provide the art with 
methods that can generate enhanced mutability in bacteria as well as 
5 providing prokaryotic organisms harboring potentially useful mutations to 
generate novel output traits for commercial applications. The ability to 
create hypermutable organisms using dominant negative alleles has great 
commercial value for the generation of innovative bacterial strains that 
display new output features useful for a variety of applications, including 

10 but not limited to the manufacturing industry for the generation of new 

biochemicals useful for detoxifying noxious chemicals from by-products of 
manufacturing processes or those used as catalysts, as well as helping in 
remediation of toxins present in the environment, including but not limited 
to polychlorobenzenes (PCBs), heavy metals and other environmental 

15 hazards for which there is a need to remove them from the environment. In 
addition to obtaining organisms that are useful for removal of toxins from 
the environment, novel microbes can be selected for enhanced activity to 
either produce increased quantity or quality of a protein or non-protein 
therapeutic molecule by means of biotransformation (3). 

20 Biotransformation is the enzymatic conversion, by a microbe or an extract 
derived from the microbe, of one chemical intermediate to the next product. 
There are many examples of biotransformation in use for the commercial 
manufacturing of important biological and chemical products, including 
Penicillin G, Erythromycin, and Clavulanic Acid as well as organisms that 

25 are efficient at conversion of "raw" materials to advanced intermediates 
and/or final products (Berry, A. Trends Biotechnol. 14(7):250-256). The 
ability to control DNA hypermutability in host bacterial strains using a 
dominant negative MMR (as described above) allows for the generation of 
variant subtypes that can be selected for new phenotypes of commercial 

30 interest, including but not limited to organisms that are toxin-resistant, have 



the capacity to degrade a toxin in situ or the ability to convert a molecule 
from an intermediate to either an advanced intermediate or a final product. 
Other applications using dominant negative MMR genes to produce genetic 
alteration of bacterial hosts for new output traits include but are not limited 
5 to recombinant production strains that produce higher quantities of a 
recombinant polypeptide as well as the use of altered endogenous genes 
that can transform chemical or catalyze manufacturing downstream 
processes. 

This application teaches of the use of a regulatable dominant 

10 negative MMR phenotype to produce a prokaryotic strain with a 
commercially beneficial output trait. Using this process, microbes 
expressing a dominant negative MMR can be directly selected for the 
phenotype of interest. Once a selected bacterium with a specified output 
trait is isolated, the hypermutable activity of the dominant negative MMR 

15 allele can be turned-off by several methods well known to those skilled in 
the art. For example, if the dominant-negative allele is expressed by an 
inducible promoter system, including but not limited to promoters such as: 
TAC-LACI, tryp (Brosius et.al. Gene 27:161-172, 1984), araBAD 
(Guzman et.al., J. Bact 177:4121-4130, 1995) pLex (La Vallie et.al, 

20 Bio.Technology 1 1:187-193, 1992), pRSET (Schoepfer, R. Gene 124:83- 
85, 1993) , pT7 (Studier J. Mol Biol 219(l):37-44, 1991) etc., the inducer 
is removed and the promoter activity is reduced, or a system that excises 
the MMR gene insert from the host cells harboring the expression vector 
such as the Cre-lox (Hasan, N. et.al. Gene 2:51-56, 1994), as well as 

25 methods that can homologously knockout of the expression vector. In 

addition to the recombinant methods outlined above that have the capacity 
to eliminate the MMR activity from the microbe, it has been demonstrated 
that many chemicals have the ability to "cure" microbial cells of plasmids. 
For example, chemical treatment of cells with drugs including bleomycin 

30 (Attfield et al. Antimicrob. Agents Chemother. 27:985-988, 1985) or 



novobiocin, coumermycin, and quinolones (Fu et al. Chemotherapy 
34:415-418, 1988) have been shown to result in microbial cells that lack 
endogenous plasmid as evidenced by Southern analysis of cured cells as 
well as sensitivity to the appropriate antibiotic (1, 41-43). Whether by use 
5 of recombinant means or treatment of cells with chemicals, removal of the 
MMR-expression plasmid results in the re-establishment of a genetically 
stable microbial cell-line. Therefore, the restoration of MMR allows host 
bacteria to function normally to repair DNA. The newly generated mutant 
bacterial strain that exhibits a novel, selected output trait is now suitable for 
10 a wide range of commercial processes or for gene/protein discovery to 
identify new biomolecules that are involved in generating a particular 
output trait. 

While it has been documented that MMR deficiency can lead to as 
much as a 1000-fold increase in the endogenous DNA mutation rate of a 

15 host, there is no assurance that MMR deficiency alone will be sufficient to 
alter every gene within the DNA of the host bacterium to create altered 
biochemicals with new activity(s). Therefore, the use of chemical agents 
and their respective analogues such as ethidium bromide, EMS, MNNG, 
MNU, Tamoxifen, 8-Hydroxyguanine, as well as others listed but not 

20 limited to in publications by: Khromov-Borisov, N.N., et.al. (Mutat. Res. 
430:55-74, 1999); Ohe, T., et.aL (Mutat Res. 429:189-199, 1999); Hour, 
T.C. et.al. (Food Chem. Toxicol. 37:569-579, 1999); Hrelia, P., et.al. 
(Chem. BioL Interact. 1 18:99-1 11, 1999); Garganta, F., et.al. (Environ. 
Mol. Mutagen. 33:75-85, 1999); Ukawa-Ishikawa S., etal. (Mutat. Res. 

25 412:99-107, 1998); www.ehs.utah.edu/ohh/mutagens , etc. can be used to 
further enhance the spectrum of mutations and increase the likelihood of 
obtaining alterations in one or more genes that can in turn generate host 
bacteria with a desired new output trait(s) (10, 39, 40). Prior art teaches 
that mismatch repair deficiency leads to hosts with an increased resistance 

30 to toxicity by chemicals with DNA damaging activity. This feature allows 



for the creation of additional genetically diverse hosts when mismatch 
defective bacteria are exposed to such agents, which would be otherwise 
impossible due to the toxic effects of such chemical mutagens [Colella, G., 
et.al. (Br. J. Cancer 80:338-343, 1999); Moreland, N.J., et.al. (Cancer Res. 
5 59:2102-2106, 1999); Humbert, CX, etal. (Carcinogenesis 20:205-214, 
1999); Glaab, W.E., et.al (Mutat Res. 398:197-207, 1998)]. Moreover, 
prior art teaches that mismatch repair is responsible for repairing chemical- 
induced DNA adducts, so therefore blocking this process could 
theoretically increase the number, types, mutation rate and genomic 

10 alterations of a bacterial host [Rasmussen, LJ. et.al. (Carcinogenesis 

17:2085-2088, 1996); Sledziewska-Gojska, E., et.al. (Mutat. Res. 383:31- 
37, 1997); and Janion, C. et.al. (Mutat. Res. 210:15-22, 1989)]. In addition 
to the chemicals listed above, other types of DNA mutagens include 
ionizing radiation and UV- irradiation, which are known to cause DNA 

15 mutagenesis in bacteria can also be used to potentially enhance this 

process. These agents which are extremely toxic to host cells and therefore 
result in a decrease in the actual pool size of altered bacterial cells are more 
tolerated in MMR defective hosts and in turn allow for a enriched spectrum 
and degree of genomic mutation (7). 

20 This application teaches new uses of MMR deficient bacterial cells 

to create commercially viable microbes that express novel output traits. 
Moreover, this application teaches the use of dominant negative MMR 
genes to decrease the endogenous MMR activity of the host followed by 
placing the cells under selection to obtain a desired, sought after output 

25 trait for commercial applications such as but not limited to recombinant 
manufacturing, biotransformation and bioremediation. Furthermore, the 
application teaches the use of restoring MMR activity to the hypermutable 
bacterial host following strain selection of the variant of interest as a means 
to genetically "fix" the new mutations in the host genome. The application 

30 also teaches the use of enhanced hypermutability in bacteria by using 



MMR deficiency and chemical or radiation mutagenesis to create variant 
subtypes of bacteria useful for commercial and other applications. The 
application describes uses of hypermutable bacteria for producing strains 
that can be used to generate new output traits for chemical manufacturing, 
5 pharmaceutical and other commercially applicable processes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Western blot of steady-state PMS134 levels in IPTG-treated 
1 0 samples in DH 1 OB strain. Cells containing the pTACPMS 134 (lane 2) 
showed a robust steady state level of protein after induction in contrast to 
cells expressing empty vector (lane 1). Blots were probed with an anti- 
human-PMS2 polyclonal antibody. 

15 Figure 2. Western blot of PMS134V5 and PMSR3V5 in IPTG-treated (+) 
and untreated (-) samples in BL21 strain. Blots were probed with an anti- 
V5 antibody, which is directed to the C-terminal tag of each protein. 

Figure 3. Number of Kanamycin resistant PMS134 and vector control 
20 DH10B clones. IPTG-induced strains were grown and plated onto KAN 
plates and grown for 18 hours at 37°C to identify number of KAN resistant 
clones due to genetic alteration. 

Figure 4. Number of Kanamycin Resistant PMS134, PMSR3 and vector 
25 control BL21 clones. IPTG-induced strains were grown and plated onto 
AMP and KAN plates and grown for 18 hours at 37°C to identify number 
of KAN resistant clones due to genetic alteration. 



30 



Figure 5. (A) Western blot of steady-state ATPMS134flag in IPTG-treated 
samples in DH10B. Lysates from untransfected cells (lane 1) and a 
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bacterial clone expressing the Arabidopsis thaliana PMS134 truncated 
protein with a FLAG epitope fused to the C-terminus (ATPMS134flag) 
(lane 2) were electrophoresed on SDS-PAGE gels. Blots were probed with 
an anti-FLAG monoclonal antibody directed to the FLAG epitope. (B) 
Number of Kanamycin Resistant ATPMS134flag and vector control 
DH10B clones. IPTG-induced strains were grown and plated onto AMP 
and KAN plates and grown for an additional 18 hours at 37°C to identify 
number of KAN resistant clones due to genetic alteration. 

Figure 6. Generation of high recombinant producer BGAL-MOR lines in 
PMS134 expressing DHSalpha host strains. 

DETAILED DESCRIPTION OF THE INVENTION 

The inventors present a method for developing hypermutable 
bacteria by altering the activity of endogenous mismatch repair activity of 
hosts. Wild type and some dominant negative alleles of mismatch repair 
genes, when introduced and expressed in bacteria, increase the rate of 
spontaneous mutations by reducing the effectiveness of the endogenous 
MMR-mediated DNA repair activity, thereby rendering the bacteria highly 
susceptible to genetic alterations due to hypermutability. Hypermutable 
bacteria can then be utilized to screen for novel mutations in a gene or a set 
of genes that produce variant siblings that exhibit a new output trait(s) not 
found in the wild type cells. 

The process of mismatch repair, also called mismatch proofreading, 
is an evolutionarily highly conserved process that is carried out by protein 
complexes described in cells as disparate as prokaryotic cells such as 
bacteria to more complex mammalian cells (14, 29, 3 1 , 33, 34). A 
mismatch repair gene is a gene that encodes one of the proteins of such a 
mismatch repair complex. Although not wanting to be bound by any 
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particular theory of mechanism of action, a mismatch repair complex is 
believed to detect distortions of the DNA helix resulting from non- 
complementary pairing of nucleotide bases. The non-complementary base 
on the newer DNA strand is excised, and the excised base is replaced with 

5 the appropriate base that is complementary to the older DNA strand. In 
this way, cells eliminate many mutations that occur as a result of mistakes 
in DNA replication, resulting in genetic stability of the sibling cells derived 
from the parental cell. 

Some wild type alleles as well as dominant negative alleles cause a 

1 0 mismatch repair defective phenotype even in the presence of a wild-type 
allele in the same cell. An example of a dominant negative allele of a 
mismatch repair gene is the human gene hPMS2-134, which carries a 
truncation mutation at codon 134 (32). The mutation causes the product of 
this gene to abnormally terminate at the position of the 134th amino acid, 

1 5 resulting in a shortened polypeptide containing the N-terminal 133 amino 
acids. Such a mutation causes an increase in the rate of mutations, which 
accumulate in cells after DNA replication. Expression of a dominant 
negative allele of a mismatch repair gene results in impairment of 
mismatch repair activity, even in the presence of the wild-type allele. Any 

20 mismatch repair allele, which produces such effect, can be used in this 
invention. In addition, the use of over-expressed wildtype MMR gene 
alleles from human, mouse, plants, and yeast in bacteria has been shown to 
cause a dominant negative effect on the bacterial hosts MMR activity (9, 
33, 34, 38). 

25 Dominant negative alleles of a mismatch repair gene can be 

obtained from the cells of humans, animals, yeast, bacteria, plants or other 
organisms. Screening cells for defective mismatch repair activity can 
identify such alleles. Mismatch repair genes may be mutant or wild type. 
Bacterial host MMR may be mutated or not. The term bacteria used in this 

30 application include any organism from the prokaryotic kingdom. These 
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organisms include genera such as but not limited to Agrobacterium, 
Anaerobacter, Aquabacterium, Azorhizobium, Bacillus, Bradyrhizobium, 
Cryobacterium, Escherichia, Enter vcoccus, Heliobacterium, Klebsiella, 
Lactobacillus, Methanococcus, Methanothermobacter, Micrococcus, 
Mycobacterium, Oceanomonas, Pseudomonas, Rhizobium, 
Staphylococcus, Streptococcus, Streptomyces, Thermusaquaticus, 
Thermaerobacter, Thermobacillus, etc. Other procaryotes that can be used 
for this application are listed at ( www.bacterio.cict.fr/va lidgenericnames). 
Bacteria exposed to chemical mutagens or radiation exposure can be 
screened for defective mismatch repair. Genomic DNA, cDNA, or mRNA 
from any cell encoding a mismatch repair protein can be analyzed for 
variations from the wild type sequence. Dominant negative alleles of a 
mismatch repair gene can also be created artificially, for example, by 
producing variants of the hPMS2-134 allele or other mismatch repair genes 
(32). Various techniques of site-directed mutagenesis can be used. The 
suitability of such alleles, whether natural or artificial, for use in generating 
hypermutable bacteria can be evaluated by testing the mismatch repair 
activity (using methods described in ref 32) caused by the allele in the 
presence of one or more wild-type alleles, to determine if it is a dominant 
negative allele. 

A bacterium that over-expresses a wild type mismatch repair allele 
or a dominant negative allele of a mismatch repair gene will become 
hypermutable. This means that the spontaneous mutation rate of such 
bacteria is elevated compared to bacteria without such alleles. The degree 
of elevation of the spontaneous mutation rate can be at least 2-fold, 5-fold, 
10-fold, 20-fold, 50-fold, 100-fold, 200-fold, 500-fold, or 1000-fold that of 
the normal bacteria as measured as a function of bacterial doubling/minute. 

According to one aspect of the invention, a polynucleotide encoding 
either a wild type or a dominant negative form of a mismatch repair protein 
is introduced into bacteria. The gene can be any dominant negative allele 
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encoding a protein which is part of a mismatch repair complex, for 
example, mutS, mutL, mutH, or mutY homology of the bacterial, yeast, plant 
or mammalian genes (14, 28). The dominant negative allele can be 
naturally occurring or made in the laboratory. The polynucleotide can be in 
5 the form of genomic DNA, cDNA, RNA, or a chemically synthesized 
polynucleotide or polypeptide. The molecule can be introduced into the 
cell by transfection, transformation, conjugation, fusion, or other methods 
well described in the literature. 

Any process can be used whereby a polynucleotide or polypeptide 

10 is introduced into a cell. The process of gene transfer can be carried out in 
a bacterial culture using a suspension culture. The bacteria can be any type 
classified under the prokaryotes. 

In general, gene transfer will be carried out using a suspension of 
cells but other methods can also be employed as long as a sufficient 

1 5 fraction of the treated cells incorporate the polynucleotide or polypeptide 
so as to allow recipient cells to be grown and utilized. The protein product 
of the polynucleotide may be transiently or stably expressed in the cell. 
Techniques for gene transfer are well known to those skilled in the art. 
Available techniques to introduce a polynucleotide or polypeptide into a 

20 prokaryote include but are not limited to electroporation, transduction, cell 
fusion, the use of chemically competent cells (e.g. calcium chloride), and 
packaging of the polynucleotide together with lipid for fusion with the cells 
of interest. Once a cell has been transformed with the dominant negative 
mismatch repair gene or protein, the cell can be propagated and 

25 manipulated in either liquid culture or on a solid agar matrix, such as a petri 
dish. If the transfected cell is stable, the gene will be retained and 
expressed at a consistent level when the promoter is constitutively active, 
or when in the presence of appropriate inducer molecules when the 
promoter is inducible, for many cell generations, and a stable, 

30 hypermutable bacterial strain results. 



An isolated bacterial cell is a clone obtained from a pool of a 
bacterial culture by chemically selecting out non-recipient strains using, for 
example, antibiotic selection of an expression vector. If the bacterial cell is 
derived from a single cell, it is defined as a clone. 
5 A polynucleotide encoding a dominant negative form of a mismatch 

repair protein can be introduced into the genome of a bacterium or 
propagated on an extra-chromosomal plasmid. Selection of clones 
harboring the mismatch repair gene expression vector can be accomplished 
by addition of any of several different antibiotics, including but not limited 

10 to ampicillin, kanamycin, chloramphenicol, zeocin, and tetracycline. The 
microbe can be any species for which suitable techniques are available to 
produce transgenic microorganisms, such as but not limited to genera 
including Bacillus, Pseudomonas, Staphylococcus, Escherichia and others. 
Any method for making transgenic bacteria known in the art can be 

15 used. According to one process of producing a transgenic microorganism, 
the polynucleotide is transfected into the microbe by one of the methods 
well known to those in the art. Next, the microbial culture is grown under 
conditions that select for cells in which the polynucleotide encoding the 
mismatch repair gene is either incorporated into the host genome as a stable 

20 entity or propagated on a self-replicating extra-chromosomal plasmid, and 
the protein encoded by the polynucleotide fragment transcribed and 
subsequently translated into a functional protein within the cell. Once 
transgenic microbe is engineered to harbor the expression construct, it is 
then propagated to generate and sustain a culture of transgenic microbes 

25 indefinitely. 

Once a stable, transgenic microorganism has been engineered to 
express a functional mismatch repair (MMR) protein, the microbe can be 
exploited to create novel mutations in one or more target gene(s) of interest 
harbored within the same microorganism. A gene of interest can be any 

30 gene naturally possessed by the bacterium or one introduced into the 
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bacterial host by standard recombinant DNA techniques. The target 
gene(s) may be known prior to the selection or unknown. One advantage 
of employing such transgenic microbes to induce mutations in resident or 
extra-chromosomal genes within the microbe is that it is unnecessary to 
5 expose the microorganism to mutagenic insult, whether it be chemical or 
radiation in nature, to produce a series of random gene alterations in the 
target gene(s). This is due to the highly efficient nature and the spectrum 
of naturally occurring mutations that result as a consequence of the altered 
mismatch repair process. However, it is possible to increase the spectrum 

10 and frequency of mutations by the concomitant use of either chemicals 
and/or radiation together with MMR defective cells. These include DNA 
mutagens, DNA alkylating agents, DNA intercalating agents, DNA 
oxidizing agents, ionizing raditation, and ultraviolet radiation. The net 
effect of the combination treatment is the increase in altered gene pool in 

1 5 the genetically altered microbe that result in an increased alteration of an 
allele(s) that are useful for producing new output traits. Another benefit of 
using MMR-defective microbes that are taught in this application is that 
one can perform a genetic screen for the direct selection of variant sub- 
clones that exhibit new output traits with commercially important 

20 applications. This allows one to bypass tedious and time consuming gene 
identification, isolation and characterization. 

Mutations can be detected by analyzing the recombinant microbe 
for alterations in the genotype and/or phenotype post-activation of the 
decreased mismatch repair activity of the transgenic microorganism. Novel 

25 genes that produce altered phenotypes in MMR-defective microbial cells 
can be discerned by any variety of molecular techniques well known to 
those in the art. For example, the microbial genome can be isolated and a 
library of restriction fragments cloned into a plasmid vector. The library 
can be introduced into a "normal" cell and the cells exhibiting the novel 

30 phenotype screened. A plasmid is isolated from those normal cells that 
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exhibit the novel phenotype and the gene(s) characterized by DNA 
sequence analysis. Alternatively, differential messenger RNA screen can 
be employed utilizing driver and tester RNA (derived from wild type and 
novel mutant respectively) followed by cloning the differential transcripts 
5 and characterizing them by standard molecular biology methods well 
known to those skilled in the art. Furthermore, if the mutant sought is on 
encoded by an extrachromosmal plasmid, then following co-expression of 
the dominant negative MMR gene and the gene of interest to be altered and 
phenotypic selection, the plasmid is isolated from mutant clones and 

10 analyzed by DNA sequence analysis by methods well known to those in the 
art. Phenotypic screening for output traits in MMR-defective mutants can 
be by biochemical activity and/or a physical phenotype of the altered gene 
product. A mutant phenotype can also be detected by identifying 
alterations in electrophoretic mobility, DNA binding in the case of 

1 5 transcription factors, spectroscopic properties such as IR, CD, X-ray 

crystallography or high field NMR analysis, or other physical or structural 
characteristics of a protein encoded by a mutant gene. It is also possible to 
screen for altered novel function of a protein in situ, in isolated form, or in 
model systems. One can screen for alteration of any property of the 

20 microorganism associated with the function of the gene of interest, whether 
the gene is known prior to the selection or unknown. The aforementioned 
screening and selection discussion is meant to illustrate the potential means 
of obtaining novel mutants with commercially valuable output traits. 

Plasmid expression vectors that harbor the mismatch repair (MMR) 

25 gene inserts can be used in combination with a number of commercially 

available regulatory sequences to control both the temporal and quantitative 
biochemical expression level of the dominant negative MMR protein. The 
regulatory sequences can be comprised of a promoter, enhancer or 
promoter/enhancer combination and can be inserted either upstream or 

30 downstream of the MMR gene to control the expression level. The 
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regulatory promoter sequence can be any of those well known to those in 
the art, including but not limited to the lad, tetracycline, tryptophan- 
inducible, phosphate inducible, T7-polymerase-inducible (30), and steroid 
inducible constructs as well as sequences which can result in the excision 
5 of the dominant negative mismatch repair gene such as those of the Cre- 
Lox system. These types of regulatory systems are familiar to those skilled 
in the art. 

Once a microorganism with a novel, desired output trait of interest 
is created, the activity of the aberrant MMR activity can beattenuated or 

10 eliminated by any of a variety of methods, including removal of the inducer 
from the culture medium that is responsible for promoter activation, gene 
disruption of the aberrant MMR gene constructs, electroporation and/or 
chemical curing of the expression plasmids (Brosius, Biotechnology 
10:205-225,1988; Wang et aL, J. of Fujian Agricultural University 28:43- 

15 46,1999; Fu et. al„ Chem Abstracts 34:415-418, 1988). The resulting 
microbe is now useful as a stable strain that can be applied to various 
commercial applications, depending upon the selection process placed upon 
it. 

In cases where genetically deficient mismatch repair bacteria 
20 [strains such as but not limited to: Ml (mutS) and in EC2416 (mutS delta 
umuDC), and mutL or mutY strains] are used to derive new output traits, 
transgenic constructs can be used that express wild-type mismatch repair 
genes sufficient to complement the genetic defect and therefore restore 
mismatch repair activity of the host after trait selection [Grzesiuk, E. et.al. 
25 (Mutagenesis 13:127-132, 1998); Bridges, B.A., et.al. (EMBO J. 16:3349- 
3356, 1997); LeClerc, J.E., Science 15:1208-1211, 1996); Jaworski, A. 
et.al. (Proc. Natl. Acad. Sci USA 92:1 1019-1 1023, 1995)]. The resulting 
microbe is genetically stable and can be applied to various commercial 
practices. 
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The use of over expressing foreign mismatch repair genes from 
human and yeast such as PMS1, MSH2, MLH1, MLH3, etc. have been 
previously demonstrated to produce a dominant negative mutator 
phenotype in bacterial hosts (35, 36, 37). In addition, the use of bacterial 
5 strains expressing prokaryotic dominant negative MMR genes as well as 
hosts that have genomic defects in endogenous MMR proteins have also 
been previously shown to result in a dominant negative mutator phenotype 
(29,32). However, the findings disclosed here teach the use of MMR 
genes, including the human PMSR2 and PMSR3 gene (ref 19), the related 

10 PMS 134 truncated MMR gene (ref 32), the plant mismatch repair genes 
(SARAH include Plant patent application) and those genes that are 
homologous to the 134 N-terminal amino acids of the PMS2 gene which 
include the MutL family of MMR proteins and including the PMSR and 
PMS2L homologs desribed by Hori et.al. (accession number NM_005394 

1 5 and NM_005395) and Nicolaides (reference 19) to create hypermutable 

microbes. In addition, this application teaches the use of DNA mutagens in 
combination with MMR defective microbial hosts to enhance the 
hypermutable production of genetic alterations. This accentuates MMR 
activity for generation of microorganisms with commercially relevant 

20 output traits such as but not limited to recombinant protein production 
strains, biotransformation, and bioremediation. 

The above disclosure generally describes the present invention. A 
more complete understanding can be obtained by reference to the following 
specific examples that will be provided herein for purposes of illustration 

25 only, and are not intended to limit the scope of the invention 
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EXAMPLES 

Example 1: Generation of inducible MMR dominant negative allele 
vectors 

Bacterial expression constructs were prepared to determine if the human 
5 PMS2 related gene (hPMSR3) (19) and the human PMS 134 gene (32) are 
capable of inactivating the bacterial MMR activity and thereby increase the 
overall frequency of genomic hypermutation, a consequence of which is the 
generation of variant sib cells with novel output traits following host 
selection. Moreover, the use of regulatable expression vectors will allow 

1 0 for suppression of dominant negative MMR alleles and restoration of the 
MMR pathway and genetic stability in hosts cells (43). For these studies, 
a plasmid encoding the hPMS134 cDNA was altered by polymerase chain 
reaction (PCR). The 5' oligonucleotide has the following structure: 5'- 
ACG CAT ATG GAG CGA GCT GAG AGC TCG AGT-3' that includes 

15 the Ndel restriction site CAT ATG, The 3 '-oligonucleotide has the 

following structure: 5'-GAA TTC TTA TCA CGT AGA ATC GAG ACC 
GAG GAG AGG GTT AGG GAT AGG CTT ACC AGT TCC AAC CTT 
CGC CGA TGC-3' that includes an EcoRI site GAA TTC and the 14 
amino acid epitope for the V5 antibody. The oligonucleotides were used 

20 for PCR under standard conditions that included 25 cycles of PCR (95°C 
for 1 minute, 55°C for 1 minute, 72°C for 1.5 minutes for 25 cycles 
followed by 3 minutes at 72°C). The PCR fragment was purified by gel 
electrophoresis and cloned into pTA2. 1 (InVitrogen) by standard cloning 
methods (Sambrook et al, Molecular Cloning: A Laboratory Manual, Third 

25 Edition, 2001), creating the plasmid pTA2.i-hPMS134. pTA2.1-hPMS134 
was digested with the restriction enzyme EcoRI to release the insert (there 
are two EcoRI restriction sites in the multiple cloning site of pTA2.1 that 
flank the insert) and the fragment filled in with Klenow fragment and 
dNTPs. Next, the fragment was gel purified, then digested with Ndel and 

30 inserted in pT7-Ea that had been digested with Ndel and BamHI (filled 
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with Klenow) and phosphatase treated. The new plasmid was designated 
pT7-Ea-hPMS134. The following strategy, similar to that described above 
to clone human PMS134, was used to construct an expression vector for the 
human related gene PMSR3. First, the hPMSR3 fragment was amplified 
5 by PCR to introduce two restriction sites, an Ndel restriction site at the 5'- 
end and an Eco RI site at the 3 '-end of the fragment. The 5'- 
oligonucleotide that was used for PCR has the following structure: 5'-ACG 
CAT ATG TGT CCT TGG CGG CCT AGA-3' that includes the Ndel 
restriction site CAT ATG. The 3 '-oligonucleotide used for PCR has the 

1 0 following structure: 5 '-GAA TTC TTA TTA CGT AGA ATC GAG ACC 
GAG GAG AGG GTT AGG GAT AGG CTT ACC CAT GTG TGA TGT 
TTC AGA GCT-3' that includes an EcoRI site GAA TTC and the V5 
epitope to allow for antibody detection. The plasmid that contained human 
PMSR3 in pBluescript SK (19) was used as the PCR target with the 

15 hPMS2-specific oligonucleotides above. Following 25 cycles of PCR 
(95°C for 1 minute, 55°C for 1 minute, 72°C for 1.5 minutes for 25 cycles 
followed by 3 minutes at 72°C). The PCR fragment was purified by gel 
electrophoresis and cloned into pTA2.1 (InVitrogen) by standard cloning 
methods (Sambrook et al., Molecular Cloning: A Laboratory Manual, Third 

20 Edition, 2001), creating the plasmid pTA2.1-hR3. pTA2.1-hR3 was next 
digested with the restriction enzyme EcoRI to release the insert (there are 
two EcoRI restriction sites in the multiple cloning site of pTA2.1 that flank 
the insert) and the fragment filled in with Klenow fragment and dNTPs. 
Then, the fragment was gel purified, then digested with Ndel and inserted 

25 in pT7-Ea that had been digested with Ndel and BamHI (filled with 

Klenow) and phosphatase treated. The new plasmid was designated pT7- 
Ea-hR3. 

BL21 cells harbor an additional expression vector for the lysozyme 
protein, which has been demonstrated to bind to the T7 polymerase in situ; 
30 this results in a bacterial strain that has very low levels of T7 polymerase 
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expression. However, upon addition of the inducer IPTG, the cells express 
high-levels of T7 polymerase due to the IPTG-inducible element that drives 
expression of the polymerase that is resident within the genome of the 
BL21 cells (30). The BL21 cells are chloramphenicol resistant due to the 
5 plasmid that expresses lysozyme within the cell. To introduce the pT7- 
hPMS134 or the pT7-hPMSR3 genes into BL21 cells, the cells were made 
competent by incubating the cells in ice cold 50mM CaCl 2 for 20 minutes, 
followed by concentrating the cells and adding super-coiled plasmid DNA 
as desecribe (Maniatis, T.et.aL Cold Spring Harbor Laboratory Press, Third 

10 Edition, 2001). Ampicillin resistant BL21 were selected on LB-agar plates 
[5% yeast extract, 10% bactotryptone, 5% NaCl, 1.5% bactoagar , pH 7.0 
(Difco)] plates containing 25 p,g/ml chloramphenicol and 100fig/ml 
ampicillin. The next day, bacterial colonies were picked and analyzed for 
vectors containing an intact pTACPMS134 or pTAC empty vector by 

15 restriction endonuclease digestion and sequence analysis. 

In addition to constructing a V5-epitope tagged PMS134 construct 
we also constructed and tested a non-epitope tagged version. This was 
prepared to demonstrate that the simple fact of epitope tagging the 
construct did not result in alteration of the dominant-negative phenotype 

20 that PMS 134 has on mismatch repair activity. For these studies, a BamHI 
restriction fragment containing the hPMS134 cDNA was filled-in with 
Klenow fragment and then sub-cloned into a Klenow-filled blunt-ended 
Ndel-Xhol site of the pTACLAC expression vector, which contains the 
isopropylthio-(3-galactosidase (IPTG)-inducible bacterial TAC promoter 

25 and ampicillin resistance gene as selectable marker. The Ndel-Xhol 
cloning site is flanked by the TACLAC promoter that contains the LacI 
repressor site followed by a Shine Dalgarno ribosome-binding site at the 5' 
flanking region and the T1T2 ribosomal RNA terminator in the 3 r flanking 
region. The TACLAC vector also contains the LacI gene, which is 

30 constitutively expressed by the TAC promoter. 
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DH10B bacterial cells containing the pBCSK vector (Stratagene), 
which constitutively expresses the (3-galactosidase gene and contains the 
chloramphenicol resistance marker for selection, were made competent via 
the CaCl2 method (Maniatis, T.et.aL Cold Spring Harbor Laboratory Press, 
5 1982). This vector turns bacterial cells blue when grown in the presence of 
IPTG and X-gal that aids in the detection of bacterial colonies. Competent 
cells were transfected with the pTAC empty vector or the pTACPMS134 
vector following the heat-shock protocol. Transfected cultures were plated 
onto LB-agar [5% yeast extract, 10% bactotryptone, 5% NaCl, 1.5% 

10 bactoagar , pH 7.0 (Difco)] plates containing 25 fig/ml chloramphenicol 
and 100|ag/ml ampicillin. The next day, bacterial colonies were picked 
and analyzed for vectors containing an intact pTACPMS134 or pTAC 
empty vector by restriction endonuclease digestion and sequence analysis. 
Ten clones of each bacteria containing correct empty or PMS134 inserts 

15 were then grown to confluence overnight in LB media (5% yeast extract, 
10% bactotryptone, 5% NaCl, pH 7.0) containing 10 p,g/ml 
chloramphenicol and 50 jug/ml ampicillin. The next day TAC empty or 
pTACPMS134 cultures were diluted 1:4 in LB medium plus 50^M IPTG 
(Gold Biotechnology) and cultures were grown for 12 and 24 hours at 

20 37°C. After incubation, 50jli1 aliquots were taken from each culture and 

added to 150pls of 2X SDS buffer and cultures were analyzed for PMS134 
protein expression by western blot. 

Western blots were carried out as follows. 50jlx1s of each PMS134 
or empty vector culture was directly lysed in 2X lysis buffer (60 mM Tris, 

25 pH 6.8, 2% SDS, 10% glycerol, 0.1 M 2-mercaptoethanol, 0.001% 

bromophenol blue) and samples were boiled for 5 minutes. Lysate proteins 
were separated by electrophoresis on 4-20% Tris glycine gels (Novex). 
Gels were electroblotted onto Immobilon-P (Millipore) in 48 mM Tris 
base, 40 mM glycine, 0.0375% SDS, 20% methanol and blocked overnight 
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at 4°C in Tris-buffered saline plus 0.05% Tween-20 and 5% condensed 
milk. Filters were probed with a rabbit polyclonal antibody generated 
against the N-terminus of the human PMS2 polypeptide (Santa Cruz), 
which is able to recognize the PMS134 polypeptide (31), followed by a 
5 secondary goat anti-rabbit horseradish peroxidase-conjugated antibody. 
After incubation with the secondary antibody, blots are developed using 
chemiluminescence (Pierce) and exposed to film to measure PMS134 
expression. 

As shown in Figure 1, a robust expression of PMS134 could be 
10 detected in bacterial cells containing pTACPMS134 (lane 2) in contrast to 
cells expressing empty vector (lane 1), which had no signal. 

For induction of PMS134 and PMSR3 in BL21 cells, the pT7-Ea- 
hPMS134 or the pT7-Ea-hPMSR3 cells were induced with 50^M IPTG for 
12 and 24 hours. Cell lysates were prepared and analyzed by western blot 
1 5 listed above using either the N-terminal PMS2 antibody to detect the 

PMS134 containing cells or the antiVS -horseradish peroxidase conjugated 
monoclonal antibody (InVitrogen) to detect the PMS134V5 and PMSR3V5 
polypeptides. Figure 2 shows the expression of PMS134V5 and PMSR3V5 
before (-) lanes and after IPTG (+) lanes induction. 

20 

Example 2: Generation of hypermutable bacteria with inducible 
dominant negative alleles of mismatch repair genes 

Bacterial clones expressing the PMS134 or the empty vector were 

25 grown in liquid culture for 24 hr at 37°C in the presence of 10 fag/ml 

chloramphenicol and 50 |ag/ml ampicillin plus 50(aM IPTG, The next day, 

cultures were diluted 1:10 in medium containing 50\xM IPTG plus 

ampicillin/chloramphenicoi (AC) or ampicillin/chloramphenicol plus 25 

|dg/ml kanamycin (ACK) and cultures were grown for 18 hr at 37°C. The 

30 following day, a 0.1 ^xl aliquot (2 \i\ diluted in 1000 ^xl of LB medium and 
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used 50 ill for plating) of cells grown in AC medium were plated on LB- 
agar plates containing 40 ng/ml of 5-bromo-4-chloro-3-indolyl-B-D- 
galactoside (X-gal) plus lOOjig/ml ampicillin (AMP), while a 1 \xl aliquot 
(1 ^1 diluted in 100 jil of LB medium and used 100 jutl for plating) of cells 
5 grown in ACK medium were plated on LB-agar plates containing X-gal 
and 50|ig/ml kanamycin (KAN). Plates were incubated for 18 hours at 
37°C. The results from these studies show that cells expressing the 
PMS134 were able to increase hypermutation in the genome of the DH10B 
bacterial strain which resulted in the production of siblings that exhibit new 

10 biological traits such as KAN resistance (Figure 3). 

Kanamycin-resistant assays using BL21 cells expressing the 
V5-tagged or untagged PMS134 or PMSR3 polypeptides were 
carried out as described above. BL21 bacterial cells that harbor the 
empty vector, pT7-PMS134 or pT7-PMSR3 were grown overnight in 

15 LB supplemented with lOOug/ml ampicillin. The overnight cultures 
were diluted 1:100 into fresh ampicillin containing medium and 
grown for 2.5 hours at 37°C with continuous shaking. When the cells 
reached an optical density (OD) of 0.6, measured at 600nm, IPTG 
was added to each culture to a final concentration of 0.5mM. Cells 

20 were incubated for 24, and 48 hours; at those time points cells were 
removed for SDS-PAGE analysis and plating (see above). 
BL21/pT7 (empty vector), BL21/pT7-PMS134, and BL21/pT7-R3 
cells were plated onto LB plates, LB plates that contained lOOug/ml 
ampicillin, and plates that contain 50ug/ml Kanamycin. The 

25 equivalent of 1 x 10 7 cells/plate were spread onto the plates. BL21 

cells that harbor the empty vector are capable of growth on LB plates 

as well as LB plates that contain lOOug/ml ampicillin; that is as 

expected since the pT7 expression vector renders the cells ampicillin 

resistant. The vector only control is not capable of growth on 

30 Kanamycin. After 24hr IPTG-induction PMS 1 34 or PMSR3 cells 
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had a significant number of KAN resistant cells while none were 
observed in BL21 parental cells grown under similar conditions 
(Figure 4). Moreover, BL21 cells containing the PMS134 or PMSR3 
genes under non-IPTG-induced conditions failed to produce any 
5 KAN resistant clones demonstrating the need for expression of the 
PMS polypeptides for hypermutability. A summary outlining the 
data 

and number of Kanamycin resistant bacterial clones is provided in 
TABLE 1. 



TABLE 1. Generation of Kanamycin resistant clones via MMR deficiency 



STRAIN 


# CELLS 
SEEDED 


AMP R 
colonies 


KAN R 
colonies 


FREQUENCY 


DH10B VEC 


50,000 


62,000 


0 


0 


DH10B 
PMS 134 


50,000 


43,146 


23 


5.3 x 10-4 


BL21 VEC 


500,000 


520,80 
0 


0 


0 


BL21 T7-Ea- 
PMS134V5 


500,000 


450,00 
0 


2,245 


4.9 x 10" 3 


BL21 T7-Ea- 
PMSR3V5 


500,000 


500,00 
0 


1,535 


3.1 x 10-3 



These data demonstrate and enable the proof-of-concept that the use of the 
dominant negative MMR genes is a viable approach to creating 
hypermutable bacteria that can lead to the generation of phenotypically 
1 5 diverse offspring when put under selective conditions. 

Using the same protocol as listed above and the same cloning 
strategy, a truncated PMS2 homolog from the Arabidopsis thaliana 
plant (reference the PLANT patent), which was cloned by 
degenerate PCR from an Arabidopsis thaliana cDNA library 
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(Strategene), was found to give a similar enhancement of genetic 
hypermutability in DHSalpha bacteria Figure 5. For detection 
purposes, we fused a FLAG epitope to the C-terminus of the PMS134 
polypeptide using PCR and an antisense primer directed to the 134 
5 codon region of the Arabidopsis PMS2 homolog followed by a 
FLAG epitope and 2 termination codons. The resultant fusion was 
termed ATPMS 1 34-flag. The AT PMS 1 34-flag gene was then 
cloned into the IPTG-inducible TACLAC expression vector and 
transfected into DHSalpha cells. Western blot of bacteria transfected 

10 with an IPTG-inducible expression vector carrying a truncated 

version (codons 1-134) of the Arabidopsis thaliana PMS2 homolog 
using the anti-FLAG antibody demonstrated the inducibility and 
steady-state protein levels of the chimeric gene. Figure 5 A shows 
the western blot containing protein from an untransfected cell (lane 

15 1) and a bacterial clone expressing the Arabidopsis PMS2-134 
truncated protein (lane 2). Following the mutagenesis protocol 
described above, bacterial cells expressing the ATPMS 134 protein 
were found to have an increase in the number of KAN resistant cells 
(12 clones) in contrast to cells expressing the empty vector that 

20 yielded no KAN resistant clone. 

Bacterial cells such as the pT7-PMS134 and pT7-R3 
harboring BL21 cells; the TACLACPMS 1 34 DH10B; the 
TACLACMLH1 DH10B cells; or the TACLAC-ATPMS134flag 
DH5alpha cells are capable of growth on LB, LB/ampicillin and 

25 LB/KAN plates because the cells have acquired mutations within 
their genome that render the cell drug resistant. Cells that express 
dominant negative MMR genes have altered the mismatch control 
pathway of the microbe, presumably altering a gene or a set of genes 
that control resistance to kanamycin. A new output trait, Kanamycin- 

30 resistance, is generated by expression of the dominant negative MMR 
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gene in these cells. These data demonstrate the ability of dominant 
negative MMR genes to produce hypermutability across a wide array 
of bacterial strains to produce new output traits such as Kanamycin 
resistance. 

5 

EXAMPLE 3: Dominant negative MMR genes can produce new 
genetic variants and commercially viable output traits in 
prokaryotic organisms. 

1 0 The data presented in EXAMPLE 2 show the ability to 

generate genetic alterations and new phenotypes in bacterial strains 
expressing dominant negative MMR genes. In this EXAMPLE we 
teach the utility of this method to create prokaryotic strains with 
commercially relevant output traits. 

15 

GENERATION OF HEAT-RESISTANT PRODUCER 
STRAINS. 

One example of commercial utility is the generation of heat- 
resistant recombinant protein producer strains. In the scalable 

20 process of recombinant manufacturing, large-scale fermentation of 
prokaryotes results in the generation of heat, which leads to 
suboptimal growth conditions for the producer strain and thus 
resulting in lower recombinant protein yields. In order to circumvent 
this problem, we employed the use of DH10B bacteria containing the 

25 inducible TACLACPMS134 gene. Briefly, cells were grown in 5 ml 
LB shake flasks containing ampicillin and IPTG-induced for 0, 24 
and 48hrs at 37C. Cultures were harvested and then incubated at 
100C for 0 5 1 or 10 minutes (times at which 100% of the wild-type 
strain perishes) and 100 jil aliquots (equivalent to 250,000 cells) were 

30 plated onto LB agar plates containing ampicillin to identify heat 
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resistant clones. Table 2 shows a typical experiment whereby cells 
containing the TACLACPMS134 gene generated a significant 
number of heat-resistant clones after 48 hours of PMS134 induction 
and hypermutation via MMR blockade. No or a few clones were 
5 observed in the uninduced or 24hr induced conditions respectively 
suggesting the needs for multiple rounds of genetic mutation to 
produce genes that are capable of allowing bacteria to survive under 
harsh conditions. Similar results were observed with other dominant 
negative mutants such as the PMSR2, PMSR3, and the human MLH1 
10 proteins (not shown). 



TABLE 2. Generation of heat-resistant clones via MMR deficiency 



Treatment 


Heated 


Heated 


Heated 




0 min 


1 min 


10 min 


TACLACVEC 


250,000 +/- 


0 


0 


0 hr IPTG 


7,500 






TACLACPMS 1 34 


265,000 +/- 


0 


0 


0 hr IPTG 


2,000 






TACLACVEC 


274,000 +/- 


1 +/-0 


0 


24 hr IPTG 


12,000 






TACLACPMS 134 


240,000 +/- 


5+/- 2 


0 


24 hrlPTG 


9,400 






TACLACVEC 


256,000 +/- 


0 


0 


48 hr IPTG 


12,000 






TACLACPMS 134 


252,000 +/- 


65 +/- 8 


3+/- 1 


48 hr IPTG 


14,000 







15 GENERATION OF HIGH RECOMBINANT PROTEIN 
PRODUCER STRAINS. 

Next, we tested the ability of bacteria expressing dominant 

negative MMR genes to produce sublclones with enhanced 

20 recombinant protein production. In these experiments again we 

employed the DH10B cells containing the TACLACPMS 134 

inducible vector plus the pTLACZ vector, which constitutively 
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expresses the p-galactosidase gene. Analysis of individual clones 
containing the T ACLACPMS 134 and pTLACZ vector typically 
produces 10-20 jag/ml of LACZ protein via shake flask fermentation 
after IPTG induction for 24 hours. To test the hypothesis that high 
5 recombinant producer strains can be generated by decreased MMR in 
bacterial strains, we induced the TACLACPMS134-pTLACZ cells 
for 48 hours with IPTG as described above. We then diluted the 
culture 1 :50 in LB medium, grew the strain for 24 hours, and plated 
10 jLils of culture (diluted in 300 \x\s of LB) onto LB amp-XGAL 

10 plates to identify candidate clones that produce robust levels of 
recombinant LACZ protein. As a control, uninduced cells were 
treated similarly and plated onto LB amp-XGAL plates. Analysis of 
the plates revealed a number of bacterial colonies exhibiting a 
number of clones with an intense BLUE staining in the 

1 5 T ACLACPMS 134/pTLACZ cells induced with IPTG but none were 
observed in uninduced clones (Figure 6). To confirm that these 
clones produced an enhanced level of LACZ, we expanded 2 clones 
with an average BLUE stain (BGAL-C1 and BGAL-C2) and 10 
clones with a robust BLUE staining (BGAL-MOR1 to BGAL- 

20 MOR10). We grew all clones in LB AMP for 24 hours without IPTG 
and replated the clones. Six out of ten BGAL-MOR clones resulted 
in a more robust P-gal stain in situ as compared to control "average" 
cells (BGALC1 and C2). We next performed a more quantitative 
assay using a p-gal ELISA assay. Briefly, 2 mis of cell centrifuged 

25 at 10,000gs for 10 minutes and resuspended in 0.5 mis of 0.25M Tris, 

pH 7.5 plus 0.0001% Tween-20. Cells were freeze-thawed 4X's and 

vortexed for 4 minutes at room temperature. Lysates were cleared of 

debris by centrifugation and supernatants were collected. Protein 

extracts were quantified for total protein using the Bradford assay 

30 (BioRad) as described by the manufacturer. Plate ELISAs were 
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carried out by coating 96 well maxisorb (NUNC) plates with 0. 1 mis 
of a 1 ng/ml (diluted in PBS pH7.0) bacterial extract solution and a 
dose range of recombinant (3-GAL (Sigma) from 0.001 to 10 mg/ml. 
All samples were plated in triplicates. Plates were coated for 2 hours, 
5 washed 2 times with PBS and blocked with 0.2mls of PBS plus 5% 
powdered milk for 30 minutes. Next, plates were washed once with 
PBS and incubated with an anti-p-galactosidase monoclonal antibody 
that recognizes both native and denatured forms (Sigma) for 2 hours. 
Plates were then washed 3 times with PBS and incubated with 0. 1 

10 mis of an anti-mouse horseradish peroxidase conjugated antibody for 
1 hour at room temperature. Plates were washed 3 times with PBS 
and incubated with TMB ELISA substrate (BioRad) for 15 to 30 
minutes. Reactions were stopped with 0. IN H2SO4 and read on a 
BioRAD plate reader at 415nm. The control clones produced 

15 roughly 9 and 13 pgs /ml of P-gal while BGAL-MOR clones 2, 3 and 
9 produced 106, 82 and 143 pgs /ml of p-gal. To determine if reason 
that these clones produced more P-gal was due to mutations in the 
plasmid promoter elements, we isolated the pTLACZ plasmid and 
retransfected it into DH10B cells as described above. In situ analysis 

20 found the resultant clones to produce similar amount of P-gal as that 
of the control. These data suggest that the BGAL-MOR 2, 3, and 9 
hosts had alterations, which results in elevated expression and/or 
stability of recombinant proteins. 

To determine if the enhanced in situ P-gal expression that was 

25 observed in BGAL-MOR clones 1 , 5, and 6, which did not appear to 
have enhanced p-gal protein levels (had less than 15 ng/ml as 
determined by ELISA) was authentic, we performed a more 
quantitative assay on these lines plus the BGAL-MOR 9, the 
BGALC1 and C2 lines as control. Cells containing an empty vector 
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(without a LACZ gene) were used as negative control. To measure 
P-gal activity, we employed a colorimetric P-gal substrate assay 
using CPRG (Roche) as described (31). Briefly, 5 jags of protein 
extract isolated for ELISA analysis (described above) were analyzed 
5 using a plate assay. Protein was added to buffer containing 45 mM 
2-mercaptoethanol, ImM MgCl 2 , 0.1 M NaP0 4 and 0.6 mg/ml 
Chlorophenol red-p-D-galactopyranoside (CPRG, Roche). Reactions 
were incubated for 1 hour, terminated by the addition of 0.5 M 
Na2CC>3, and analyzed by spectrophotometry at 576 nm in a BioRad 
10 plate reader. Analysis of these extracts confirmed our in situ data 
that these cells did have increased P-gal activity (TABLE 3). 



TABLE 3. Generation of bacterial clones with increased (3-gal enzymatic activity via 
MMR deficiency. 



Clone 


|3-gal protein 
(Hg/ml) 


P-gal activity (O.D. 576) 


BGAL-C1 


9 


0.413 +/-.092 


BGAL-C2 


13 


0.393 +/-.105 


BGAL-MOR1 


14 


0.899 +/-.134 


BGAL-MOR5 


13 


0.952 +/- .133 


BGAL-MOR6 


15 


0.923 +/-.100 


BGAL-MOR9 


143 


0.987 +/-.106 


Empty vector 




0.132 +/- .036 



15 

Because there was no observable increase in the amount of P- 
gal protein one likely hypothesis is that the P-gal gene structure was 
mutated during the hypermutability growth stage and now produces a 
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more active enzyme. Sequence analysis confirms that this may be 
the reason for enhanced activity in a subset of clones. 

Together, these data demonstrate the ability to produce 
genetically altered prokaryotic host strains using dominant negative 
5 MMR genes to generate commercially valuable output traits such as 
high recombinant protein producer lines and structurally altered 
enzymes with enhanced activities. 

EXAMPLE 4: Mutations in the host genome generated by 
1 0 defective MMR are genetically stable 

As described in EXAMPLE 2 and 3, manipulation of the 
MMR pathway in microbes results in alterations within the host 
genome and the ability to select for a novel output traits. It is 

1 5 important that the mutations introduced as a result of defective MMR 
is genetically stable and passed on to daughter cells once a desired 
output pathway is established. To determine the genetic stability of 
mutations introduced into the microbial genome the following 
experiment was performed. Five independent colonies from pT7- 

20 PMS 1 34 and pT7-PMSR3 that are kanamycin resistance were grown 
overnight from an isolated colony in 5 ml of LB. Next, l|iL of the 
overnight culture from these cultures were inoculated into another 5 
mL of LB and grown overnight to saturation. Under these growth 
conditions the microbial cells have divided over 20 generations. 

25 Therefore, if the new output trait generated by alteration of MMR is 
unstable, the cells should "revert" back from kanamycin resistance to 
kanamycin sensitivity. Cells were plated onto LB plates and 
incubated overnight at 37°C. Next, the colonies (about 1,000/plate) 
were replica plated to LB, LB^ 100 , and LB kan5 ° plates and incubated 

30 at 37°C overnight. Analysis of clones from these studies reveal that a 

-32- 



strict correlation occurs with loss of dominant negative MMR 
expression and phenotype stability. No loss of KAN resistant clones 
generated in Example 3 were observed when cells were grown in the 
absence of IPTG (not expressing PMS134), while 5 revertants out of 

5 1200 were observed in clones the were continually grown in IPTG 
(express PMS134). Extended culturing of cells and replica plating 
found no reversions of KAN resistance in cultures grown in the 
absence of IPTG, which produce no PMS134 as determined by 
western blot (data not shown). 

10 These data demonstrate the utility of employing inducible 

expression systems and dominant negative MMR genes in 
prokaryotes to generate genetically altered strains for commercial 
applications such as but not limited to enhanced recombinant 
manufacturing and biotransformation that can then in turn be restored 

15 to a genetically stable host with a "fixed" new genotype that is 
suitable for commercial processes. 

EXAMPLE 5: Enhanced Generation of MMR-Defective Bacteria 
and Chemical Mutagens for the Generation of New Output 
20 Traits 

It has been previously documented that MMR deficiency yields to 
increased mutation frequency and increased resistance to toxic effects of 
chemical mutagens (CM) and their respective analogues such as but not 

25 limited to those as: ethidium bromide, EMS, MNNG, MNU, Tamoxifen, 8- 
Hydroxyguanine, as well as others listed but not limited to in publications 
by: Khromov-Borisov, N.N., etal. (Mutat. Res. 430:55-74, 1999); Ohe, T., 
et.al. (Mutat. Res. 429:189-199, 1999); Hour, T.C. et.al. (Food Chem. 
Toxicol. 37:569-579, 1999); Hrelia, P., et.al. (Chem. Biol. Interact. 118:99- 

30 111, 1999); Garganta, F., et.al. (Environ. Mol. Mutagen. 33:75-85, 1999); 
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Ukawa-Ishikawa S., et.al. (Mutat Res. 412:99-107 5 1998); 
www.ehs,utah.edu/ohh/mutagens , etc. To demonstrate the ability of CMs 
to increase the mutation frequency in MMR defective bacterial cells, we 
exposed T7-PMS134 BL21 cells to CMs. 
5 T7-PMS134 cells and empty vector control cells were grown with 

IPTG for 48 hours and then diluted 1 : 50 in LB plus IPTG and increasing 
amounts of ethyl methane sulfonate (EMS) from 0, 1, 10, 50, 100, and 200 
jiM. 10 jiL aliquots of culture (diluted in 300 jal LB) were plated out on 
LB agar plus ampicillin plates and grown overnight at 37C. The next day 

10 plates were analyzed for cell viablility as determined by colony formation. 
Analysis found that while no significant difference in colony number was 
observed between the pT7-PMS134 and control at the 0, 1 , or 10 uM 
concentrations (all had > 1000 colonies), the number of control cells were 
reduced to 30 and 0 at the 50 and 100 jaM concentrations, respectively. No 

1 5 difference was observed in the pT7-PMS 134 cells treated with 0, 1 , 1 0 or 
50 jliM, while a 3 fold reduction was observed in cultures treated with 100 
juM EMS. The 200 \xM treatment was toxic for both lines. These data 
demonstrate the ability of MMR deficiency to protect prokaryotes against 
the toxic effects of DNA akylating agents and provides a means to generate 

20 a wider range of mutations that can lead to an increased number of genetic 
variations and an increase in the number of new biochemical activities 
within host proteins to produce new output traits for commercial 
applications. 

To confirm that MMR deficient bacterial cells treated with CM can 

25 result in an increased mutation rate and produce a greater number of 

variants, we cultured pT7-PMS134 cells and empty vector controls in the 

presence of IPTG for 48 hours, followed by dilution and regrowth in 25|aM 

EMS for 24 hours as described above. Cells were plated out on 100 mM 

petri dishes containing amplicillin or KAN and scored for KAN resistance. 

30 Analysis revealed that an 1 1-fold increase in the generation of KAN 
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resistant cells were found in pT7-Ea-PMS134V5 cells in contrast to control 
cells. 

These data demonstrate the use of employing a regulated dominant 
negative MMR system plus chemical mutagens to produce enhanced 
5 numbers of genetically altered prokaryotic strains that can be selected for 
new output traits. This methods is now useful generating such organims 
for commercial applications such as but not limited to recombinant 
manufacturing, biotransformation, and altered biochemicals 
(biotransformation) with enhanced activities for manufacturing purposes 
10 and gene discovery for pharmaceutical compound development. 

EXAMPLE 6: Alternative Methods to Inhibition of Bacterial 
MMR Activity 

The inhibition of MMR activity in a host organism can be achieved 
15 by introducing a dominant negative allele as shown in EXAMPLES 2 and 3. 
This application also teaches us the use of using regulated systems to 
control MMR in prokaryotes to generate genetic diversity and output traits 
for commercial applications. Other ways to regulate the suppression of 
MMR activity of a host is by using genetic recombination to knock out 
20 alleles of a MMR gene that can be spliced out such after selection using a 
system such as the CRE-Lox system; 2) blocking MMR protein 
dimerization with other subunits (which is required for activity) by the 
introduction of polypeptides or antibodies into the host via transfection 
methods routinely used by those skilled in the art; or 3) decreasing the 
25 expression of a MMR gene using anti-sense oligonucleotides. 

MMR gene knockouts. We intend to generate disrupted targeting vectors of 
a particular MMR gene and introduce it into the genome of bacteria using 
methods standard in the art. Bacteria exhibiting hypermutability will be 
30 useful to produce genetically diverse offspring for commercial applications. 
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Bacteria will be confirmed to have lost the expression of the MMR gene 
using standard northern and biochemical techniques (as described in 
reference 32). MMR gene loci can be knocked out, strains selected for new 
output traits and MMR restored by introducing a wildtype MMR gene to 
5 complement the KO locus. Other strategies include using KO vectors that 
can target a MMR gene locus, select for host output traits and then have the 
KO vector "spliced" from the genome after strain generation. This process 
could be performed using systems such as but not limited to CRE-Lox. 

10 Blocking peptides. MMR subunits (MutS and MutL proteins) interact to 
form active MMR complexes. Peptides are able to specifically inhibit the 
binding of two proteins by competitive inhibition. The use of peptides or 
antibodies to conserved domains of a particular MMR gene can be 
introduced into prokaryotic cells using lipid transfer methods that are 

1 5 standard in the art. Bacteria will be confirmed to have lost the expression 
of the MMR gene using standard northern and biochemical techniques (as 
described in reference 32). Bacteria exhibiting hypermutability will be 
useful to produce genetically diverse sibs for commercial applications. 

20 Discussion 

The results described above will lead to several conclusions. Yhe 
expression of dominant negative MMR proteins results in an increase in 
hypermutability in bacteria. This activity is due to the inhibition of MMR 
biochemical activity in these hosts. This method provides a claim for use of 

25 dominant negative MMR genes and their encoded products for the creation 
of hypermutable bacteria to produce new output traits for commercial 
applications. 
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EXAMPLES OF MMR GENES AND ENCODED POLYPEPTIDES 



Yeast MLH1 cDNA (accession number U07187) 

1 aaataggaat gtgatacctt ctattgcatg caaagatagt gtaggaggcg ctgctattgc 
61 caaagacttt tgagaccgct tgctgtttca ttatagttga ggagttctcg aagacgagaa 
121 attagcagtt ttcggtgttt agtaatcgcg ctagcatgct aggacaattt aactgcaaaa 
181 ttttgatacg atagtgatag taaatggaag gtaaaaataa catagaccta tcaataagca 
241 atgtctctca gaataaaagc acttgatgca tcagtggtta acaaaattgc tgcaggtgag 
301 atcataatat cccccgtaaa tgctctcaaa gaaatgatgg agaattccat cgatgcgaat 
361 gctacaatga ttgatattct agtcaaggaa ggaggaatta aggtacttca aataacagat 
421 aacggatctg gaattaataa agcagacctg ccaatcttat gtgagcgatt cacgacgtcc 
481 aaattacaaa aattcgaaga tttgagtcag attcaaacgt atggattccg aggagaagct 
541 ttagccagta tctcacatgt ggcaagagtc acagtaacga caaaagttaa agaagacaga 
601 tgtgcatgga gagtttcata tgcagaaggt aagatgttgg aaagccccaa acctgttgct 
661 ggaaaagacg gtaccacgat cctagttgaa gacctttttt tcaatattcc ttctagatta 
721 agggccttga ggtcccataa tgatgaatac tctaaaatat tagatgttgt cgggcgatac 
781 gccattcatt ccaaggacat tggcttttct tgtaaaaagt tcggagactc taattattct 
841 ttatcagtta aaccttcata tacagtccag gataggatta ggactgtgtt caataaatct 
901 gtggcttcga atttaattac ttttcatatc agcaaagtag aagatttaaa cctggaaagc 
961 gttgatggaa aggtgtgtaa tttgaatttc atatccaaaa agtccatttc attaattttt 
1021 ttcattaata atagactagt gacatgtgat cttctaagaa gagctttgaa cagcgtttac 
1081 tccaattatc tgccaaaggg cttcagacct tttatttatt tgggaattgt tatagatccg 
1141 gcggctgttg atgttaacgt tcacccgaca aagagagagg ttcgtttcct gagccaagat 
1201 gagatcatag agaaaatcgc caatcaattg cacgccgaat tatctgccat tgatacttca 
1261 cgtactttca aggcttcttc aatttcaaca aacaagccag agtcattgat accatttaat 
1321 gacaccatag aaagtgatag gaataggaag agtctccgac aagcccaagt ggtagagaat 
1381 tcatatacga cagccaatag tcaactaagg aaagcgaaaa gacaagagaa taaactagtc 
1441 agaatagatg cttcacaagc taaaattacg tcatttttat cctcaagtca acagttcaac 
1501 tttgaaggat cgtctacaaa gcgacaactg agtgaaccca aggtaacaaa tgtaagccac 
1561 tcccaagagg cagaaaagct gacactaaat gaaagcgaac aaccgcgtga tgccaataca 
1621 atcaatgata atgacttgaa ggatcaacct aagaagaaac aaaagttggg ggattataaa 
1681 gttccaagca ttgccgatga cgaaaagaat gcactcccga tttcaaaaga cgggtatatt 
1741 agagtaccta aggagcgagt taatgttaat cttacgagta tcaagaaatt gcgtgaaaaa 
1801 gtagatgatt cgatacatcg agaactaaca gacatttttg caaatttgaa ttacgttggg 
1861 gttgtagatg aggaaagaag attagccgct attcagcatg acttaaagct ttttttaata 
1921 gattacggat ctgtgtgcta tgagctattc tatcagattg gtttgacaga cttcgcaaac 
1981 tttggtaaga taaacctaca gagtacaaat gtgtcagatg atatagtttt gtataatctc 
2041 ctatcagaat ttgacgagtt aaatgacgat gcttccaaag aaaaaataat tagtaaaata 
2101 tgggacatga gcagtatgct aaatgagtac tattccatag aattggtgaa tgatggtcta 
2161 gataatgact taaagtctgt gaagctaaaa tctctaccac tacttttaaa aggctacatt 
2221 ccatctctgg tcaagttacc attttttata tatcgcctgg gtaaagaagt tgattgggag 
2281 gatgaacaag agtgtctaga tggtatttta agagagattg cattactcta tatacctgat 
2341 atggttccga aagtcgatac actcgatgca tcgttgtcag aagacgaaaa agcccagttt 
2401 ataaatagaa aggaacacat atcctcatta ctagaacacg ttctcttccc ttgtatcaaa 
2461 cgaaggttcc tggcccctag acacattctc aaggatgtcg tggaaatagc caaccttcca 
2521 gatctataca aagtttttga gaggtgttaa ctttaaaacg ttttggctgt aataccaaag 
2581 tttttgttta tttcctgagt gtgattgtgt ttcatttgaa agtgtatgcc ctttccttta 
2 641 acgattcatc cgcgagattt caaaggatat gaaatatggt tgcagttagg aaagtatgtc 
2701 agaaatgtat attcggattg aaactcttct aatagttctg aagtcacttg gttccgtatt 
2761 gttttcgtcc tcttcctcaa gcaacgattc ttgtctaagc ttattcaacg gtaccaaaga 
2821 cccgagtcct tttatgagag aaaacatttc atcatttttc aactcaatta tcttaatatc 
2881 attttgtagt attttgaaaa caggatggta aaacgaatca cctgaatcta gaagctgtac 
2941 cttgtcccat aaaagtttta atttactgag cctttcggtc aagtaaacta gtttatctag 
3001 ttttgaaccg aatattgtgg gcagatttgc agtaagttca gttagatcta ctaaaagttg 
3061 tttgacagca gccgattcca caaaaatttg gtaaaaggag atgaaagaga cctcgcgcgt 
3121 aatggtttgc atcaccatcg gatgtctgtt gaaaaactca ctttttgcat ggaagttatt 
3181 aacaataaga ctaatgatta ccttagaata atgtataa 

Yeast MLH1 protein (accession number U07187) 

MSLRIKALDASVVNKIAAGEI I 1 S P VN AL KEMM E N S I DAN ATM I 

DILVKEGGIKVLQITDNGSGINKADLPILCERFTTSKLQKFEDLSQIQTYGFRGEALA 
SISHVARVTVTTKVKEDRCAWRVSYAEGKMLESPKPVAGKDGTTILVEDLFFNIPSRL 
RALRSHNDEYSKILDVVGRYAIHSKDIGFSCKKFGDSNYSLSVKPSYTVQDRIRTVFN 
KSVASNLITFHISKVEDLNLESVDGKVCNLNFISKKSISLIFFIKNRLVTCDLLRRAL 
NSVYSNYLPKGFRPFI YLGIVIDPAAVDVKVHPTKREVRFLSQDEIIEKIANQLHAEL 
SAIDTSRTFECASSISTNKPESLIPFNDTIESDRNRKSLRQAQVVENSYTTANSQLRKA 
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KRQENKLVRIDASQAKITSFLSSSQQFNFEGSSTKRQLSEPKVTNVSHSQEAEKLTLN 
ESEQPRDAKTIKDNDLKDQPKKKQKLGDYKVPSIADDEKNALPISKDGYIRVPKERVN 
VNLTSIKKLREKVDDSIHRELTDIFANLNYVGVVDEERRLAAIQHDLKLFLIDYGSVC 
YELFYQIGLTDFANFGKINLQSTNVSDDIVLYNLLSEFDELNDDASKEKIISKIWDMS 
SMLNEYYSIELVNDGLDNDLKSVKLKSLPLLLKGYIPSLVKLPFFIYRLGKEVDWEDE 
QECLDGILREIALLYIPDMVPKVDTLDASLSEDEKAQFINRKEHISSLLEHVLFPCIK 
RRFLAPRHILKDWEIANLPDLYKVFERC 



Mouse PMS2 protein 

MEQTEGVSTE CAKAIKPIDG KSVHQICSGQ 
YGVDLIEVSD NGCGVEEEKF EGLALKHHTS 
TISTCHGSAS VGTRLVFDHN GKITQKTPYP 
YSKMVQVLQA YCIISAGVRV SCTNQLGQGK 
PFVQLPPSDA VCEEYGLSTS GRHKTFSTFR 
RSLSLSMRFY HMYNRHQYPF VVLNVSVDSE 
GMFDSDANKL NVNQQPLLDV EGNLVKLHTA 
LREAFSLHPT KEIKSRGPET AELTRSFPSE 
SPGDCMDREK IEKDSGLSST SAGSEEEFST 
DCRPPGTGQS LKPEDHGYQC KALPLARLSP 
EVDVAIKMNK RIVLLEFSLS SLAKRMKQLQ 
LRKEISKSMF AEMEILGQFN LGFIVTKLKE 
ITPQTLNLTA VNEAVLIENL EIFRKNGFDF 
DELIFMLSDS PGVMCRPSRV RQMFASRACR 
PHGRPTMRHV ANLDVISQN 
859 



VILSLSTAVK ELIENSVDAG ATTIDLRLKD 60 
KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
RPKGTTVSVQ HLFYTLPVRY KEFQRNIKKE 18 0 
RHAVVCTSGT SGMKENIGSV FGQKQLQSLI 24 0 
ASFHSARTAP GGVQQTGSFS SSIRGPVTQQ 300 
CVDINVTPDK RQILLQEEKL LLAVLKTSLI 360 
ELEKPVPGKQ DNSPSLKSTA DEKRVASISR 420 
KRGVLSSYPS DVISYRGLRG SQDKLVSPTD 48 0 
PEVASSFSSD YNVSSLEDRP SQETINCGDL 54 0 
TNAKRFKTEE RPSNVNISQR LPGPQSTSAA 600 
HLKAQNKHEL SYRKFRAKIC PGENQAAEDE 660 
DLFLVDQHAA DEKYNFEMLQ QHTVLQAQRL 720 
VIDE DAP VTE RAKLISLPTS KNWTFGPQDI 780 
KSVMIGTALN ASEMKKLITH MGEMDHPWNC 840 



Mouse PMS2 cDNA 

gaattccggt gaaggtcctg aagaatttcc 
taacctgtcg tcaggtaacg atggtgtata 
gtcttttccc gagagcggca ccgcaactct 
catccatgga gcaaaccgaa ggcgtgagta 
atgggaagtc agtccatcaa atttgttctg 
tgaaggagtt gatagaaaat agtgtagatg 
aagactatgg ggtggacctc attgaagttt 
actttgaagg tctagctctg aaacatcaca 
cgcaggttga aactttcggc tttcgggggg 
atgtcactat atctacctgc cacgggtctg 
ataatgggaa aatcacccag aaaactccct 
tgcagcactt attttataca ctacccgtgc 
aggagtattc caaaatggtg caggtcttac 
gtgtaagctg cactaatcag ctcggacagg 
gcacgtctgg catgaaggaa aatatcgggt 
tcattccttt tgttcagctg ccccctagtg 
cttcaggacg ccacaaaacc ttttctacgt 
cgccgggagg agtgcaacag acaggcagtt 
agcaaaggtc tctaagcttg tcaatgaggt 
catttgtcgt ccttaacgtt tccgttgact 
ataaaaggca aattctacta caagaagaga 
tgataggaat gtttgacagt gatgcaaaca 



agattcctga gtatcattgg aggagacaga 60 
tgcaacagaa atgggtgttc ctggagacgc 120 
cccgcggtga ctgtgactgg aggagtcctg 180 
cagaatgtgc taaggccatc aagcctattg 24 0 
ggcaggtgat actcagttta agcaccgctg 300 
ctggtgctac tactattgat ctaaggctta 360 
cagacaatgg atgtggggta gaagaagaaa 420 
catctaagat tcaagagttt gccgacctca 480 
aagctctgag ctctctgtgt gcactaagtg 540 
caagcgttgg gactcgactg gtgtttgacc 600 
acccccgacc taaaggaacc acagtcagtg 660 
gttacaaaga gtttcagagg aacattaaaa 720 
aggcgtactg tatcatctca gcaggcgtcc 780 
ggaagcggca cgctgtggtg tgcacaagcg 840 
ctgtgtttgg ccagaagcag ttgcaaagcc 900 
acgctgtgtg tgaagagtac ggcctgagca 960 
ttcgggcttc atttcacagt gcacgcacgg 1020 
tttcttcatc aatcagaggc cctgtgaccc 1080 
tttatcacat gtataaccgg catcagtacc 1140 
cagaatgtgt ggatattaat gtaactccag 1200 
agctattgct ggccgtttta aagacctcct 1260 
agcttaatgt caaccagcag ccactgctag 1320 
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atgttgaagg taacttagta aagctgcata 
agcaagataa ctctccttca ctgaagagca 
ccaggctgag agaggccttt tctcttcatc 
agactgctga actgacacgg agttttccaa 
cttcagacgt catctcttac agaggcctcc 
cggacagccc tggtgactgt atggacagag 
gcacctcagc tggctctgag gaagagttca 
gtgactataa cgtgagctcc ctagaagaca 
acctggactg ccgtcctcca ggtacaggac 
aatgcaaagc tctacctcta gctcgtctgt 
aggaaagacc ctcaaatgtc aacatttctc 
cagctgaggt cgatgtagcc ataaaaatga 
tgagttctct agctaagcga atgaagcagt 
aactgagtta cagaaaattt agggccaaga 
atgaactcag aaaagagatt agtaaatcga 
ttaacctggg atttatagta accaaactga 
ctgcggatga gaagtacaac tttgagatgc 
ggctcatcac accccagact ctgaacttaa 
atctggaaat attcagaaag aatggctttg 
ctgaaagggc taaattgatt tccttaccaa 
atatagatga actgatcttt atgttaagtg 
gagtcagaca gatgtttgct tccagagcct 
tcaatgcgag cgagatgaag aagctcatca 
actgccccca cggcaggcca accatgaggc 
actgacacac cccttgtagc atagagttta 
ttttaagtaa tctgattatc gttgtacaaa 
catttaaaag cagtgttaag gcaggcatga 
tgatccggtg ggagctcatg tgagcccagg 
agactcaatt caaggacaaa aaaaaaaaga 

human PMS2 protein 

MKQLPAATVR LLSSSQIITS VVSVVKELIE 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY 
YVLDGSGHIL SQKPSHLGQG TTVTALRLFK 
ILKPDLRIVF VHNKAVIWQK SRVSDHKMAL 
PKCDADHSFT SLSTPERSFI FINSRPVHQK 
VPTADVDVNL TPDKSQVLLQ NKESVLIALE 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF 
CSSEISNIDK NTKNAFQDIS MSNVSWENSQ 
NEEEAGLENS SEISADEWSR GNILKNSVGE 
LNEDSCNKKS KVIDNKSGKV TAYDLLSNRV 
ATLQIEELWK TLSEEEKLKY EEKATKDLER 
NLAQKHKLKT SLSNQPKLDE LLQSQIEKRR 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY 
SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD 
CLPFYGVADL KEILNAILNR NAKEVYECRP 
IYRMKHQFGN EIKECVHGRP FFHHLTYLPE 



ctgcagaact agaaaagcct gtgccaggaa 1380 
cagcagacga gaaaagggta gcatccatct 1440 
ctactaaaga gatcaagtct aggggtccag 1500 
gtgagaaaag gggcgtgtta tcctcttatc 1560 
gtggctcgca ggacaaattg gtgagtccca 1620 
agaaaataga aaaagactca gggctcagca 1680 
gcaccccaga agtggccagt agctttagca 1740 
gaccttctca ggaaaccata aactgtggtg 1800 
agtccttgaa gccagaagac catggatatc 18 60 
cacccacaaa tgccaagcgc ttcaagacag 1920 
aaagattgcc tggtcctcag agcacctcag 1980 
ataagagaat cgtgctcctc gagttctctc 2040 
tacagcacct aaaggcgcag aacaaacatg 2100 
tttgccctgg agaaaaccaa gcagcagaag 2160 
tgtttgcaga gatggagatc ttgggtcagt 2220 
aagaggacct cttcctggtg gaccagcatg 2280 
tgcagcagca cacggtgctc caggcgcaga 2340 
ctgctgtcaa tgaagctgta ctgatagaaa 2400 
actttgtcat tgatgaggat gctccagtca 2460 
ctagtaaaaa ctggaccttt ggaccccaag 2520 
acagccctgg ggtcatgtgc cggccctcac 2580 
gtcggaagtc agtgatgatt ggaacggcgc 264 0 
cccacatggg tgagatggac cacccctgga 2700 
acgttgccaa tctggatgtc atctctcaga 2760 
ttacagattg ttcggtttgc aaagagaagg 2820 
aattagcatg ctgctttaat gtactggatc 2880 
tggagtgttc ctctagctca gctacttggg 2940 
actttgagac cactccgagc cacattcatg 3000 
tatttttgaa gccttttaaa aaaaaa 3056 

NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 
GFRGEALGSI CCIAEVLITT RTAADNFSTQ 120 
NLPVRKQFYS TAKKCKDE I K KIQDLLMSFG 18 0 
MSVLGTAVMN NMESFQYHSE ESQIYLSGFL 24 0 
DILKLIRHHY NLKCLKESTR LYPVFFLKID 300 
NLMTTCYGPL PSTNSYENNK TDVSAADIVL 360 
QNDMHNDESG KNTDDCLNHQ ISIGDFGYGH 420 
TEYSKTCFIS SVKHTQSENG NKDHIDESGE 480 
NIEPVKILVP EKSLPCKVSN NNYPIPEQMN 540 
IKKPMSASAL FVQDHRPQFL IENPKTSLED 600 
YNSQMKRAIE QESQMSLKDG RKKIKPTSAW 6 60 
SQNIKMVQIP FSMKNLKINF KKQNKVDLEE 720 
RVEEALLFKR LLENHKLPAE PLEKPIMLTE 780 
PRLTANGFKI KLIPGVSITE NYLEIEGMAN 840 
RKVISYLEGE AVRLSRQLPM YLSKEDIQDI 900 
TT 932 
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Human PMS2 cDNA 

cgaggcggat cgggtgttgc atccatggag 
aaggccatca aacctattga tcggaagtca 
ctgagtctaa gcactgcggt aaaggagtta 
aatattgatc taaagcttaa ggactatgga 
tgtggggtag aagaagaaaa cttcgaaggc 
caagagtttg ccgacctaac tcaggttgaa 
tcactttgtg cactgagcga tgtcaccatt 
actcgactga tgtttgatca caatgggaaa 
agagggacca cagtcagcgt gcagcagtta 
tttcaaagga atattaagaa ggagtatgcc 
atcatttcag caggcatccg tgtaagttgc 
cctgtggtat gcacaggtgg aagccccagc 
cagaagcagt tgcaaagcct cattcctttt 
gaagagtacg gtttgagctg ttcggatgct 
atttcacaat gcacgcatgg agttggaagg 
aaccggcggc cttgtgaccc agcaaaggtc 
tataatcgac accagtatcc atttgttgtt 
gatatcaatg ttactccaga taaaaggcaa 
gcagttttaa agacctcttt gataggaatg 
agtcagcagc cactgctgga tgttgaaggt 
gaaaagccca tggtagaaaa gcaggatcaa 
aaagacgtgt ccatttccag actgcgagag 
aagcctcaca gcccaaagac tccagaacca 
atgctgtctt ctagcacttc aggtgccatc 
gaggcagtga gttccagtca cggacccagt 
gactcggggc acggcagcac ttccgtggat 
agtcactgca gcagcgagta tgcggccagc 
gtggactctc aggagaaagc gcctgaaact 
tcaaaccagg aagataccgg atgtaaattt 
accccaaaca caaagcgttt taaaaaagaa 
aagttagtaa atactcagga catgtcagcc 
aagaaagttg tgcccctgga cttttctatg 
catcatgaag cacagcaaag tgaaggggaa 
tgtcctggag aaaatcaagc agccgaagat 
tttgcagaaa tggaaatcat tggtcagttt 
gaggatatct tcatagtgga ccagcatgcc 
cagcagcaca ccgtgctcca ggggcagagg 
gctgttaatg aagctgttct gatagaaaat 
tttgttatcg atgaaaatgc tccagtcact 
agtaaaaact ggaccttcgg accccaggac 
agccctgggg tcatgtgccg gccttcccga 
cggaagtcgg tgatgattgg gactgctctt 
cacatggggg agatggacca cccctggaac 
atcgccaacc tgggtgtcat ttctcagaac 



cgagctgaga gctcgagtac agaacctgct 60 
gtccatcaga tttgctctgg gcaggtggta 120 
gtagaaaaca gtctggatgc tggtgccact 180 
gtggatctta ttgaagtttc agacaatgga 240 
ttaactctga aacatcacac atctaagatt 300 
acttttggct ttcgggggga agctctgagc 360 
tctacctgcc acgcatcggc gaaggttgga 420 
attatccaga aaacccccta cccccgcccc 480 
ttttccacac tacctgtgcg ccataaggaa 540 
aaaatggtcc aggtcttaca tgcatactgt 600 
accaatcagc ttggacaagg aaaacgacag 660 
ataaaggaaa atatcggctc tgtgtttggg 720 
gttcagctgc cccctagtga ctccgtgtgt 780 
ctgcataatc ttttttacat ctcaggtttc 840 
agttcaacag acagacagtt tttctttatc 900 
tgcagactcg tgaatgaggt ctaccacatg 960 
cttaacattt ctgttgattc agaatgcgtt 1020 
attttgctac aagaggaaaa gcttttgttg 1080 
tttgatagtg atgtcaacaa gctaaatgtc 1140 
aacttaataa aaatgcatgc agcggatttg 1200 
tccccttcat taaggactgg agaagaaaaa 12 60 
gccttttctc ttcgtcacac aacagagaac 1320 
agaaggagcc ctctaggaca gaaaaggggt 1380 
tctgacaaag gcgtcctgag acctcagaaa 1440 
gaccctacgg acagagcgga ggtggagaag 1500 
tctgaggggt tcagcatccc agacacgggc 15 60 
tccccagggg acaggggctc gcaggaacat 1620 
gacgactctt tttcagatgt ggactgccat 1680 
cgagttttgc ctcagccaac taatctcgca 1740 
gaaattcttt ccagttctga catttgtcaa 1800 
tctcaggttg atgtagctgt gaaaattaat 1860 
agttctttag ctaaacgaat aaagcagtta 1920 
cagaattaca ggaagtttag ggcaaagatt 1980 
gaactaagaa aagagataag taaaacgatg 2040 
aacctgggat ttataataac caaactgaat 2100 
acggacgaga agtataactt cgagatgctg 2160 
ctcatagcac ctcagactct caacttaact 2220 
ctggaaatat ttagaaagaa tggctttgat 2280 
gaaagggcta aactgatttc cttgccaact 2340 
gtcgatgaac tgatcttcat gctgagcgac 2400 
gtcaagcaga tgtttgcctc cagagcctgc 2460 
aacacaagcg agatgaagaa actgatcacc 2520 
tgtccccatg gaaggccaac catgagacac 2580 
tgaccgtagt cactgtatgg aataattggt 2 640 
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tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2700 
atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 2760 
cttttcaaac c 
2771 



human PMS1 protein 
MKQLPAATVR LLSSSQIITS VVSVVKELIE 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY 
YVLDGSGHIL SQKPSHLGQG TTVTALRLFK 
ILKPDLRIVF VHNKAVIWQK SRVSDHKMAL 
PKCDADHSFT SLSTPERSFI FINSRPVHQK 
VPTADVDVNL TPDKSQVLLQ NKESVLIALE 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF 
CSSEI5NIDK NTKNAFQDIS MSNVSWENSQ 
NEEEAGLENS SEISADEWSR GNILKNSVGE 
LNEDSCNKKS NVIDNKSGKV TAYDLLSNRV 
ATLQIEELWK TLSEEEKLKY EEKATKDLER 
NLAQKHKLKT SLSNQPKLDE LLQSQIEKRR 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY 
SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD 
CLPFYGVADL KEILNAILNR NAKEVYECRP 
IYRMKHQFGN EIKECVHGRP FFHHLTYLPE 
932 



NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 

GFRGEALGSI CCIAEVLITT RTAADNFSTQ 120 

NLPVRKQFYS TAKKCKDEIK KIQDLLMSFG 18 0 

MSVXGTAVMN NMESFQYHSE ESQIYLSGFL 240 

DILKLIRHHY NLKCLKESTR LYPVFFLKID 300 

NLMTTCYGPL PSTNSYENNK TDVSAADIVL 360 

QNDMHNDESG KNTDDCLNHQ ISIGDFGYGH 42 0 

TEYSKTCFIS SVKHTQSENG NKDHIDESGE 480 

NIEPVKILVP EKSLPCKVSN NNYPIPEQMN 540 

IKKPMSASAL FVQDHRPQFL IENPKTSLED 600 

YNSQMKRAIE QESQMSLKDG RKKIKPTSAW 660 

SQNIKMVQIP FSMKNLKIMF KKQNKVDLEE 720 

RVEEALLFKR LLENHKLPAE PLEKPIMLTE 780 

PRLTANGFKI KLIPGVSITE NYLEIEGMAN 840 

RKVISYLEGE AVRLSRQLPM YLSKEDIQDI 900 
TT 



Human PMS1 cDNA 

ggcacgagtg gctgcttgcg gctagtggat 
ctgctctgtt aaaagcgaaa atgaaacaat 
gttctcagat catcacttcg gtggtcagtg 
atgctggtgc cacaagcgta gatgttaaac 
tgcgagataa cggggagggt atcaaggctg 
acacctcaaa aataaatagt catgaagatc 
gagaagcctt ggggtcaatt tgttgtatag 
ctgataattt tagcacccag tatgttttag 
cttcacatct tggtcaaggt acaactgtaa 
taagaaagca gttttactca actgcaaaaa 
atctcctcat gagctttggt atccttaaac 
aggcagttat ttggcagaaa agcagagtat 
tggggactgc tgttatgaac aatatggaat 
tttatctcag tggatttctt ccaaagtgtg 
caccagaaag aagtttcatc ttcataaaca 
agttaatccg acatcattac aatctgaaat 
ttttctttct gaaaatcgat gttcctacag 
aaagccaagt attattacaa aataaggaat 
cgacttgtta tggaccatta cctagtacaa 
ccgcagctga catcgttctt agtaaaacag 
aatcatctgg aaagaattat tcaaatgttg 
tgcataatga tgaatctgga aaaaacactg 
gtgactttgg ttatggtcat tgtagtagtg 



ggtaattgcc tgcctcgcgc tagcagcaag 60 
tgcctgcggc aacagttcga ctcctttcaa 120 
ttgtaaaaga gcttattgaa aactccttgg 180 
tggagaacta tggatttgat aaaattgagg 240 
ttgatgcacc tgtaatggca atgaagtact 300 
ttgaaaattt gacaacttac ggttttcgtg 360 
ctgaggtttt aattacaaca agaacggctg 420 
atggcagtgg ccacatactt tctcagaaac 480 
ctgctttaag attatttaag aatctacctg 540 
aatgtaaaga tgaaataaaa aagatccaag 600 
ctgacttaag gattgtcttt gtacataaca 660 
cagatcacaa gatggctctc atgtcagttc 720 
cctttcagta ccactctgaa gaatctcaga 780 
atgcagacca ctctttcact agtctttcaa 840 
gtcgaccagt acatcaaaaa gatatcttaa 900 
gcctaaagga atctactcgt ttgtatcctg 960 
ctgatgttga tgtaaattta acaccagata 1020 
ctgttttaat tgctcttgaa aatctgatga 1080 
attcttatga aaataataaa acagatgttt 1140 
cagaaacaga tgtgcttttt aataaagtgg 1200 
atacttcagt cattccattc caaaatgata 1260 
atgattgttt aaatcaccag ataagtattg 1320 
aaatttctaa cattgataaa aacactaaga 1380 
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atgcatttca ggacatttca atgagtaatg 
gtaaaacttg ttttataagt tccgttaagc 
atatagatga gagtggggaa aatgaggaag 
ctgcagatga gtggagcagg ggaaatatac 
ctgtgaaaat tttagtgcct gaaaaaagtt 
caatccctga acaaatgaat cttaatgaag 
ataataaatc tggaaaagtt acagcttatg 
ccatgtcagc aagtgctctt tttgttcaag 
ctaagactag tttagaggat gcaacactac 
aagaggaaaa actgaaatat gaagagaagg 
aaatgaagag agccattgaa caggagtcac 
taaaacccac cagcgcatgg aatttggccc 
atcaaccaaa acttgatgaa ctccttcagt 
ttaaaatggt acagatcccc ttttctatga 
acaaagttga cttagaagag aaggatgaac 
atgcatggct aatgacatcc aaaacagagg 
aagccctgct atttaaaaga cttcttgaga 
agccaattat gttaacagag agtcttttta 
aaatgacagc agatgaccaa agatacagtg 
cagcgaatgg tttcaagata aaattgatac 
aaatagaagg aatggctaat tgtctcccat 
ttaatgctat attaaacaga aatgcaaagg 
taagttattt agagggagaa gcagtgcgtc 
aagaggacat ccaagacatt atctacagaa 
agtgtgttca tggtcgccca ttttttcatc 
taaatatgtt taagaagatt agttaccatt 
tctggtttta aattatcttt gtattatgtg 
ctgacttgtt tttatattga aaaaagttcc 
aac 



tatcatggga gaactctcag acggaatata 1440 
acacccagtc agaaaatggc aataaagacc 1500 
aagcaggtct tgaaaactct tcggaaattt 1560 
ttaaaaattc agtgggagag aatattgaac 1620 
taccatgtaa agtaagtaat aataattatc 1680 
attcatgtaa caaaaaatca aatgtaatag 1740 
atttacttag caatcgagta atcaagaaac 1800 
atcatcgtcc tcagtttctc atagaaaatc 1860 
aaattgaaga actgtggaag acattgagtg 1920 
ctactaaaga cttggaacga tacaatagtc 1980 
aaatgtcact aaaagatggc agaaaaaaga 2040 
agaagcacaa gttaaaaacc tcattatcta 2100 
cccaaattga aaaaagaagg agtcaaaata 2160 
aaaacttaaa aataaatttt aagaaacaaa 2220 
cttgcttgat ccacaatctc aggtttcctg 2280 
taatgttatt aaatccatat agagtagaag 2340 
atcataaact tcctgcagag ccactggaaa 2400 
atggatctca ttatttagac gttttatata 2460 
gatcaactta cctgtctgat cctcgtctta 2520 
caggagtttc aattactgaa aattacttgg 2580 
tctatggagt agcagattta aaagaaattc 2640 
aagtttatga atgtagacct cgcaaagtga 2700 
tatccagaca attacccatg tacttatcaa 27 60 
tgaagcacca gtttggaaat gaaattaaag 2820 
atttaaccta tcttccagaa actacatgat 2880 
gaaattggtt ctgtcataaa acagcatgag 2940 
tcacatggtt attttttaaa tgaggattca 3000 
acgtattgta gaaaacgtaa ataaactaat 3060 

3063 



human MSH2 protein 

MAVQPKETLQ LESAAEVGFV RFFQGMPEKP 
QGVIKYMGPA GAKNLQSVVL SKMNFESFVK 
YKASPGNLSQ FEDILFGNND MSASIGVVGV 
NDQFSNLEAL LIQIGPKECV LPGGETAGDM 
LNRLLKGKKG EQMNSAVLPE MENQVAVSSL 
KLDIAAVRAL NLFQGSVEDT TGSQSLAALL 
NLVEAFVEDA ELRQTLQEDL LRRFPDLNRL 
LEKHEGKHQK LLLAVFVTPL TDLRSDFSKF 
LREIMNDLEK KMQSTLISAA RDLGLDPGKQ 
TVDIQKNGVK FTNSKLTSLN EEYTKNKTEY 
QLDAVV3FAH VSNGAPVPYV RPAILEKGQG 
KQMFHI ITGP NMGGKSTYIR QTGVIVLMAQ 
GVSTFMAEML ETASILRSAT KDSLIIIDEL 
ATHFHELTAL ANQIPTVNNL HVTALTTEET 
IECAKQKALE LEEFQYIGES QGYDIMEPAA 
EENITIKLKQ LKAEVIAKNN SFVNEIISRI 



TTTVRLFDRG DFYTAHGEDA LLAAREVFKT 60 
DLLLVRQYRV EVYKNRAGNK ASKENDWYLA 120 
KMSAVDGQRQ VGVGYVDSIQ RKLGLCEFPD 18 0 
GKLRQIIQRG GILITERKKA DFSTKDIYQD 24 0 
SAVIKFLELL SDDSNFGQFE LTTFDFSQYM 300 
NKCKTPQGQR LVNQWIKQPL MDKNRIEERL 360 
AKKFQRQAAN LQDCYRLYQG INQLPNVIQA 420 
QEMIETTLDM DQVENHEFLV KPSFDPNLSE 480 
IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS 540 
EEAQDAIVKE IVNISSGYVE PMQTLNDVLA 600 
RIILKASRHA CVEVQDEIAF IPNDVYFEKD 660 
IGCFVPCESA EVSIVDCILA RVGAGDSQLK 720 
GRGTSTYDGF GLAWAISEYI ATKIGAFCMF 78 0 
LTMLYQVKKG VCDQSFGIHV AELANFPKHV 84 0 
KKCYLEREQG EKIIQEFLSK VKQMPFTEMS 900 
KVTT 934 
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Human MSH2 cDNA 

ggcgggaaac agcttagtgg gtgtggggtc gcgcattttc ttcaaccagg aggtgaggag 60 
gtttcgacat ggcggtgcag ccgaaggaga cgctgcagtt ggagagcgcg gccgaggtcg 120 
gcttcgtgcg cttctttcag ggcatgccgg agaagccgac caccacagtg cgccttttcg 180 
accggggcga cttctatacg gcgcacggcg aggacgcgct gctggccgcc cgggaggtgt 24 0 
tcaagaccca gggggtgatc aagtacatgg ggccggcagg agcaaagaat ctgcagagtg 300 
ttgtgcttag taaaatgaat tttgaatctt ttgtaaaaga tcttcttctg gttcgtcagt 360 
atagagttga agtttataag aatagagctg gaaataaggc atccaaggag aatgattggt 420 
atttggcata taaggcttct cctggcaatc tctctcagtt tgaagacatt ctctttggta 480 
acaatgatat gtcagcttcc attggtgttg tgggtgttaa aatgtccgca gttgatggcc 540 
agagacaggt tggagttggg tatgtggatt ccatacagag gaaactagga ctgtgtgaat 600 
tccctgataa tgatcagttc tccaatcttg aggctctcct catccagatt ggaccaaagg 660 
aatgtgtttt acccggagga gagactgctg gagacatggg gaaactgaga cagataattc 720 
aaagaggagg aattctgatc acagaaagaa aaaaagctga cttttccaca aaagacattt 780 
atcaggacct caaccggttg ttgaaaggca aaaagggaga gcagatgaat agtgctgtat 840 
tgccagaaat ggagaatcag gttgcagttt catcactgtc tgcggtaatc aagtttttag 900 
aactcttatc agatgattcc aactttggac agtttgaact gactactttt gacttcagcc 960 
agtatatgaa attggatatt gcagcagtca gagcccttaa cctttttcag ggttctgttg 1020 
aagataccac tggctctcag tctctggctg ccttgctgaa taagtgtaaa acccctcaag 1080 
gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac agaatagagg 1140 
agagattgaa tttagtggaa gcttttgtag aagatgcaga attgaggcag actttacaag 1200 
aagatttact tcgtcgattc ccagatctta accgacttgc caagaagttt caaagacaag 1260 
cagcaaactt acaagattgt taccgactct atcagggtat aaatcaacta cctaatgtta 1320 
tacaggctct ggaaaaacat gaaggaaaac accagaaatt attgttggca gtttttgtga 1380 
ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt 1440 
tagatatgga tcaggtggaa aaccatgaat tccttgtaaa accttcattt gatcctaatc 1500 
tcagtgaatt aagagaaata atgaatgact tggaaaagaa gatgcagtca acattaataa 1560 
gtgcagccag agatcttggc ttggaccctg gcaaacagat taaactggat tccagtgcac 1620 
agtttggata ttactttcgt gtaacctgta aggaagaaaa agtccttcgt aacaataaaa 1680 
actttagtac tgtagatatc cagaagaatg gtgttaaatt taccaacagc aaattgactt 1740 
ctttaaatga agagtatacc aaaaataaaa cagaatatga agaagcccag gatgccattg 1800 
ttaaagaaat tgtcaatatt tcttcaggct atgtagaacc aatgcagaca ctcaatgatg 18 60 
tgttagctca gctagatgct gttgtcagct ttgctcacgt gtcaaatgga gcacctgttc 1920 
catatgtacg accagccatt ttggagaaag gacaaggaag aattatatta aaagcatcca 1980 
ggcatgcttg tgttgaagtt caagatgaaa ttgcatttat tcctaatgac gtatactttg 2040 
aaaaagataa acagatgttc cacatcatta ctggccccaa tatgggaggt aaatcaacat 2100 
atattcgaca aactggggtg atagtactca tggcccaaat tgggtgtttt gtgccatgtg 2160 
agtcagcaga agtgtccatt gtggactgca tcttagcccg agtaggggct ggtgacagtc 2220 
aattgaaagg agtctccacg ttcatggctg aaatgttgga aactgcttct atcctcaggt 2280 
ctgcaaccaa agattcatta ataatcatag atgaattggg aagaggaact tctacctacg 2340 
atggatttgg gttagcatgg gctatatcag aatacattgc aacaaagatt ggtgcttttt 2400 
gcatgtttgc aacccatttt catgaactta ctgccttggc caatcagata ccaactgtta 2460 
ataatctaca tgtcacagca ctcaccactg aagagacctt aactatgctt tatcaggtga 2520 
agaaaggtgt ctgtgatcaa agttttggga ttcatgttgc agagcttgct aatttcccta 2580 
agcatgtaat agagtgtgct aaacagaaag ccctggaact tgaggagttt cagtatattg 2640 
gagaatcgca aggatatgat atcatggaac cagcagcaaa gaagtgctat ctggaaagag 2700 
agcaaggtga aaaaattatt caggagttcc tgtccaaggt gaaacaaatg ccctttactg 27 60 
aaatgtcaga agaaaacatc acaataaagt taaaacagct aaaagctgaa gtaatagcaa 2820 
agaataatag ctttgtaaat gaaatcattt cacgaataaa agttactacg tgaaaaatcc 2880 

-43 - 



cagtaatgga atgaaggtaa tattgataag 
atattaaccc tttttccata gtgttaactg 
atatttagta atattttact ttgaggacat 
gctgtaactg aggactgttt gcaattgaca 
ataaataaaa tcatgtagtt tgtgg 
3145 



ctattgtctg taatagtttt atattgtttt 2940 
tcagtgccca tgggctatca acttaataag 3000 
tttcaaagat ttttattttg aaaaatgaga 3060 
taggcaataa taagtgatgt gctgaatttt 3120 



human MLHl protein 
MSFVAGVIRR LDETWNRIA AGEVIQRPAN 
IQDNGTGIRK EDLDIVCERF TTSKLQSFED 
DGKCAYRASY SDGKLKAPPK PCAGNQGTQI 
GRYSVHNAGI SFSVKKQGET VADVRTLPNA 
KMNGYISNAN YSVKKCIFLL FINHRLVEST 
QNVDVNVHPT KHEVHFLHEE SILERVQQHI 
KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE 
SGRARQQDEE MLELPAPAEV AAKNQSLEGD 
MVEDDSRKEM TAACTPRRRI INLTSVLSLQ 
AQHQTKLYLL NTTKLSEELF YQILIYDFAN 
DGPKEGLAEY IVEFLKKKAE MLADYFSLEI 
ATEVNWDEEK ECFESLSKEC AMFYSIRKQY 
YKALRSHILP PKHFTEDGNI LQLANLPDLY 



AIKEMIENCL DAKSTSIQVI VKEGGLKLIQ 60 

LASISTYGFR GEALASISHV AHVTITTKTA 12 0 

TVEDLFYKIA TRRKALKNPS EEYGKILEW 180 

STVDNIRSIF GNAVSRELIE IGCEDKTLAF 240 

SLRKAIETVY AAYLPKNTHP FLYLSLEISP 300 

ESKLLGSNSS RMYFTQTLLP G LAG PS GEM V 360 

QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS 42 0 

TTKGTSEMSE KRGPTSSNPR KRHREDSDVE 4 80 

EEINEQGHEV LREMLHNHSF VGCVNPQWAL 540 

FGVLRLSEPA PLFDLAMLAL DSPESGWTEE 600 

DEEGNLIGLP LLIDNYVPPL EGLPIFILRL 660 

ISEESTLSGQ QSEVPGSIPN SWKWTVEHIV 720 

KVFERC 756 



Human MLHl cDNA 

cttggctctt ctggcgccaa aatgtcgttc 
acagtggtga accgcatcgc ggcgggggaa 
gagatgattg agaactgttt agatgcaaaa 
ggaggcctga agttgattca gatccaagac 
gatattgtat gtgaaaggtt cactactagt 
atttctacct atggctttcg aggtgaggct 
actattacaa cgaaaacagc tgatggaaag 
aaactgaaag cccctcctaa accatgtgct 
gacctttttt acaacatagc cacgaggaga 
gggaaaattt tggaagttgt tggcaggtat 
gttaaaaaac aaggagagac agtagctgat 
gacaatattc gctccatctt tggaaatgct 
gaggataaaa ccctagcctt caaaatgaat 
aagaagtgca tcttcttact cttcatcaac 
aaagccatag aaacagtgta tgcagcctat 
ctcagtttag aaatcagtcc ccagaatgtg 
gttcacttcc tgcacgagga gagcatcctg 
ctcctgggct ccaattcctc caggatgtac 
ggcccctctg gggagatggt taaatccaca 
agtagtgata aggtctatgc ccaccagatg 
gatgcatttc tgcagcctct gagcaaaccc 
gaggataaga cagatatttc tagtggcagg 
ctcccagccc ctgctgaagt ggctgccaaa 
gggacttcag aaatgtcaga gaagagagga 
cgggaagatt ctgatgtgga aatggtggaa 
tgtacccccc ggagaaggat cattaacctc 



gtggcagggg ttattcggcg gctggacgag 60 
gttatccagc ggccagctaa tgctatcaaa 120 
tccacaagta ttcaagtgat tgttaaagag 180 
aatggcaccg ggatcaggaa agaagatctg 240 
aaactgcagt cctttgagga tttagccagt 300 
ttggccagca taagccatgt ggctcatgtt 360 
tgtgcataca gagcaagtta ctcagatgga 420 
ggcaatcaag ggacccagat cacggtggag 48 0 
aaagctttaa aaaatccaag tgaagaatat 540 
tcagtacaca atgcaggcat tagtttctca 600 
gttaggacac tacccaatgc ctcaaccgtg 660 
gttagtcgag aactgataga aattggatgt 720 
ggttacatat ccaatgcaaa ctactcagtg 780 
catcgtctgg tagaatcaac ttccttgaga 840 
ttgcccaaaa acacacaccc attcctgtac 900 
gatgttaatg tgcaccccac aaagcatgaa 960 
gagcgggtgc agcagcacat cgagagcaag 1020 
ttcacccaga ctttgctacc aggacttgct 1080 
acaagtctga cctcgtcttc tacttctgga 1140 
gttcgtacag attcccggga acagaagctt 1200 
ctgtccagtc agccccaggc cattgtcaca 1260 
gctaggcagc aagatgagga gatgcttgaa 132 0 
aatcagagct tggaggggga tacaacaaag 1380 
cctacttcca gcaaccccag aaagagacat 1440 
gatgattccc gaaaggaaat gactgcagct 1500 
actagtgttt tgagtctcca ggaagaaatt 1560 

-44- 



aatgagcagg gacatgaggt tctccgggag 
gtgaatcctc agtgggcctt ggcacagcat 
aagcttagtg aagaactgtt ctaccagata 
ctcaggttat cggagccagc accgctcttt 
gagagtggct ggacagagga agatggtccc 
tttctgaaga agaaggctga gatgcttgca 
gggaacctga ttggattacc ccttctgatt 
cctatcttca ttcttcgact agccactgag 
gaaagcctca gtaaagaatg cgctatgttc 
gagtcgaccc tctcaggcca gcagagtgaa 
tggactgtgg aacacattgt ctataaagcc 
ttcacagaag atggaaatat cctgcagctt 
gagaggtgtt aaatatggtt atttatgcac 
cgatacaaag tgttgtatca aagtgtgata 
cacttaagac ttatacttgc cttctgatag 
aataaataga tgtgtcttaa cata 



atgttgcata accactcctt cgtgggctgt 1620 
caaaccaagt tataccttct caacaccacc 1680 
ctcatttatg attttgccaa ttttggtgtt 1740 
gaccttgcca tgcttgcctt agatagtcca 1800 
aaagaaggac ttgctgaata cattgttgag 1860 
gactatttct ctttggaaat tgatgaggaa 1920 
gacaactatg tgcccccttt ggagggactg 1980 
gtgaattggg acgaagaaaa ggaatgtttt 2040 
tattccatcc ggaagcagta catatctgag 2100 
gtgcctggct ccattccaaa ctcctggaag 2160 
ttgcgctcac acattctgcc tcctaaacat 2220 
gctaacctgc ctgatctata caaagtcttt 2280 
tgtgggatgt gttcttcttt ctctgtattc 2340 
tacaaagtgt accaacataa gtgttggtag 24 00 
tattccttta tacacagtgg attgattata 24 60 

2484 



hPMS2-134 protein 

MKQLPAATVR LLSSSQIITS VVSVVKELIE 
I KAVDAPVMA MKYYT SKINS HEDLENLTTY 
YVLDGSGHIL SQK 



NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 
GFRGEALGSI CCIAEVLITT RTAADNFSTQ 120 

133 



hPMS2-134 CDNA 

cgaggcggat cgggtgttgc atccatggag 
aaggccatca aacctattga tcggaagtca 
ctgagtctaa gcactgcggt aaaggagtta 
aatattgatc taaagcttaa ggactatgga 
tgtggggtag aagaagaaaa cttcgaaggc 
caagagtttg ccgacctaac tcaggttgaa 
tcactttgtg cactgagcga tgtcaccatt 
acttga 



cgagctgaga gctcgagtac agaacctgct 60 
gtccatcaga tttgctctgg gcaggtggta 120 
gtagaaaaca gtctggatgc tggtgccact 180 
gtggatctta ttgaagtttc agacaatgga 240 
ttaactctga aacatcacac atctaagatt 300 
acttttggct ttcgggggga agctctgagc 360 
tctacctgcc acgcatcggc gaaggttgga 420 

426 



hMSH6 (human cDNA) ACCESSION U2894 6 
MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSP 

GGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEG 
YPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ 
KGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDK 
SEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSD 
EISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETK 
NTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFD 
ASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLV 
FMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRV 
VRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVC FVDTSLG 
KFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPG 
SQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALS 
ALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTL 
NNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDL 
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MVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKII 
DFLSALEGFKVMCKI IGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDT 
AFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGI 
GRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLK 
DCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPF 
LELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGE^AYCVLVTGPNMGGKSTLMRQ 
AGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHA 
TAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAV 
RLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKA 
REFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL" 



hPMSR2 (human cDNA) ACCESSION U38964 

1 ggcgctccta cctgcaagtg gctagtgcca agtgctgggc cgccgctcct gccgtgcatg 
61 ttggggagcc agtacatgca ggtgggctcc acacggagag gggcgcagac ccggtgacag 
121 ggctttacct ggtacatcgg catggcgcaa ccaaagcaag agagggtggc gcgtgccaga 
181 caccaacggt cggaaaccgc cagacaccaa cggtcggaaa ccgccaagac accaacgctc 
241 ggaaaccgcc agacaccaac gctcggaaac cgccagacac caaggctcgg aatccacgcc 
301 aggccacgac ggagggcgac tacctccctt ctgaccctgc tgctggcgtt cggaaaaaac 
361 gcagtccggt gtgctctgat tggtccaggc tctttgacgt cacggactcg acctttgaca 
421 gagccactag gcgaaaagga gagacgggaa gtattttttc cgccccgccc ggaaagggtg 
481 gagcacaacg tcgaaagcag ccgttgggag cccaggaggc ggggcgcctg tgggagccgt 
541 ggagggaact ttcccagtcc ccgaggcgga tccggtgttg catccttgga gcgagctgag 
601 aactcgagta cagaacctgc taaggccatc aaacctattg atcggaagtc agtccatcag 
661 atttgctctg ggccggtggt accgagtcta aggccgaatg cggtgaagga gttagtagaa 
721 aacagtctgg atgctggtgc cactaatgtt gatctaaagc ttaaggacta tggagtggat 
781 ctcattgaag tttcaggcaa tggatgtggg gtagaagaag aaaacttcga aggctttact 
841 ctgaaacatc acacatgtaa gattcaagag tttgccgacc taactcaggt ggaaactttt 
901 ggctttcggg gggaagctct gagctcactt tgtgcactga gtgatgtcac catttctacc 
961 tgccgtgtat cagcgaaggt tgggactcga ctggtgtttg atcactatgg gaaaatcatc 
1021 cagaaaaccc cctacccccg ccccagaggg atgacagtca gcgtgaagca gttattttct 
1081 acgctacctg tgcaccataa agaatttcaa aggaatatta agaagaaacg tgcctgcttc 
1141 cccttcgcct tctgccgtga ttgtcagttt cctgaggcct ccccagccat gcttcctgta 
1201 cagcctgtag aactgactcc tagaagtacc ccaccccacc cctgctcctt ggaggacaac 
1261 gtgatcactg tattcagctc tgtcaagaat ggtccaggtt cttctagatg atctgcacaa 
1321 atggttcctc tcctccttcc tgatgtctgc cattagcatt ggaataaagt tcctgctgaa 
1381 aatccaaaaa aaaaaaaaaa aaaaaaaa 



hPMSR2 {human protein) ACCESSION U38964 
MAQPKQERVARARHQRSETARHQRSETAKTPTLGNRQTPTLGNR 

QTPRLGIHARPRRRATTSLLTLLLAFGKNAVRCALIGPGSLTSRTRPLTEPLGEKERR 
EVFFPPRPERVEHNVESSRWEPRRRGACGSRGGNFPSPRGGSGVASLERAENSSTEPA 
KAIKPIDRKSVHQICSGPVVPSLRPNAVKELVENSLDAGATNVDLKLKDYGVDLIEVS 
GNGCGVEEENFEGFTLKHHTCKIQEFADLTQVETFGFRGEALSSLCALSDVTISTCRV 
SAKVGTRLVFDHYGKIIQKTPYPRPRGMTVSVKQLFSTLPVHHKEFQRNIKKKRACFP 
FAFCRDCQFPEASPAMLPVQPVELTPRSTPPHPCSLEDNVITVFSSVKNGPGSSR 



HPMSR3 {human cDNA) ACCESSION U38979 

1 tttttagaaa ctgatgttta ttttccatca accatttttc catgctgctt aagagaatat 
61 gcaagaacag cttaagacca gtcagtggtt gctcctaccc attcagtggc ctgagcagtg 
121 gggagctgca gaccagtctt ccgtggcagg ctgagcgctc cagtcttcag tagggaattg 
181 ctgaataggc acagagggca cctgtacacc ttcagaccag tctgcaacct caggctgagt 
241 agcagtgaac tcaggagcgg gagcagtcca ttcaccctga aattcctcct tggtcactgc 
301 cttctcagca gcagcctgct cttctttttc aatctcttca ggatctctgt agaagtacag 
361 atcaggcatg acctcccatg ggtgttcacg ggaaatggtg ccacgcatgc gcagaacttc 
421 ccgagccagc atccaccaca ttaaacccac tgagtgagct cccttgttgt tgcatgggat 
481 ggcaatgtcc acatagcgca gaggagaatc tgtgttacac agcgcaatgg taggtaggtt 
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541 aacataagat gcctccgtga gaggcgaagg ggcggcggga cccgggcctg gcccgtatgt 

601 gtccttggcg gcctagacta ggccgtcgct gtatggtgag ccccagggag gcggatctgg 

661 gcccccagaa ggacacccgc ctggatttgc cccgtagccc ggcccgggcc cctcgggagc 

721 agaacagcct tggtgaggtg gacaggaggg gacctcgcga gcagacgcgc gcgccagcga 

781 cagcagcccc gccccggcct ctcgggagcc ggggggcaga ggctgcggag ccccaggagg 

841 gtctatcagc cacagtctct gcatgtttcc aagagcaaca ggaaatgaac acattgcagg 

901 ggccagtgtc attcaaagat gtggctgtgg atttcaccca ggaggagtgg cggcaactgg 

961 accctgatga gaagatagca tacggggatg tgatgttgga gaactacagc catctagttt 

1021 ctgtggggta tgattatcac caagccaaac atcatcatgg agtggaggtg aaggaagtgg 

1081 agcagggaga ggagccgtgg ataatggaag gtgaatttcc atgtcaacat agtccagaac 

1141 ctgctaaggc catcaaacct attgatcgga agtcagtcca tcagatttgc tctgggccag 

1201 tggtactgag tctaagcact gcagtgaagg agttagtaga aaacagtctg gatgctggtg 

1261 ccactaatat tgatctaaag cttaaggact atggagtgga tctcattgaa gtttcagaca 

1321 atggatgtgg ggtagaagaa gaaaactttg aaggcttaat ctctttcagc tctgaaacat 

1381 cacacatgta agattcaaga gtttgccgac ctaactgaag ttgaaacttt cggttttcag 

1441 ggggaagctc tgagctcact gtgtgcactg agcgatgtca ccatttctac ctgccacgcg 

1501 ttggtgaagg ttgggactcg actggtgttt gatcacgatg ggaaaatcat ccaggaaacc 

1561 ccctaccccc accccagagg gaccacagtc agcgtgaagc agttattttc tacgctacct 

1621 gtgcgccata aggaatttca aaggaatatt aagaagacgt gcctgcttcc ccttcgcctt 

1681 ctgccgtgat tgtcagtttc ctgaggcctc cccagccatg cttcctgtac agcctgcaga 

1741 actgtgagtc aattaaacct cttttcttca taaattaaaa aaaaa 



hPMSR3 (human protein) ACCESSION U38979 



MCPWRPRLGRRCMVSPREADLGPQKDTRLDLPRSPARAPREQNS 

LGEVDRRGPREQTRAPATAAPPRPLGSRGAEAAEPQEGLSATVSACFQEQQEMNTLQG 
PVSFKDVAVDFTQEEWRQLDPDEKIAYGDVMLENYSHLVSVGYDYHQAKHHHGVEVKE 
VEQGEEPWIMEGEFPCQHSPEPAKAIKPIDRKSVHQICSGPWLSLSTAVKELVENSL 
DAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLISFSSETSHM" 



hPMSL9 (human cDNA) ACCESSION NM_005395 

1 atgtgtcctt ggcggcctag actaggccgt cgctgtatgg tgagccccag ggaggcggat 
61 ctgggccccc agaaggacac ccgcctggat ttgccccgta gcccggcccg ggcccctcgg 
121 gagcagaaca gccttggtga ggtggacagg aggggacctc gcgagcagac gcgcgcgcca 
181 gcgacagcag ccccgccccg gcctctcggg agccgggggg cagaggctgc ggagccccag 
241 gagggtctat cagccacagt ctctgcatgt ttccaagagc aacaggaaat gaacacattg 
301 caggggccag tgtcattcaa agatgtggct gtggatttca cccaggagga gtggcggcaa 
361 ctggaccctg atgagaagat agcatacggg gatgtgatgt tggagaacta cagccatcta 
421 gtttctgtgg ggtatgatta tcaccaagcc aaacatcatc atggagtgga ggtgaaggaa 
481 gtggagcagg gagaggagcc gtggataatg gaaggtgaat ttccatgtca acatagtcca 
541 gaacctgcta aggccatcaa acctattgat cggaagtcag tccatcagat ttgctctggg 
601 ccagtggtac tgagtctaag cactgcagtg aaggagttag tagaaaacag tctggatgct 
661 ggtgccacta atattgatct aaagcttaag gactatggag tggatctcat tgaagtttca 
721 gacaatggat gtggggtaga agaagaaaac tttgaaggct taatctcttt cagctctgaa 
781 acatcacaca tgtaa 



hPMSL9 (human protein) ACCESSION NM_005395 

MCPWRPRLGRRCMVSPREADLGPQKDTRLDLPRSPARAPREQNS 

LGEVDRRGPREQTRAPATAAPPRPLGSRGAEAAEPQEGLSATVSACFQEQQEMNTLQG 

PVSFKDVAVDFTQEEWRQLDPDEKIAYGDVMLENYSHLVSVGYDYHQAKHHHGVEVKE 

VEQGEEPWIMEGEFPCQHSPEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKELVENSL 

DAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLISFSSETSHM" 
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